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© A simple and highly efficient method for cloning cDNAs from mammalian expression libraries based on 
2! transient expression in mammalian host cells has been discovered. Novel expression vectors allowing highly 
^efficient construction of mammalian cDNA libraries are disclosed. The cloning method of the invention which has 
^been used to clone genes for cell surface antigens of human lymphocytes, has general application in gene 
0> cloning. Cell surface antigens cloned according to the present invention have been purified, and the nucleotide 
*^and amino acid sequences determined. These antigens have diagnostic and therapeutic utility in immune- 
O mediated infections in mammals, including humans. 
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RAPID IMMUNOSELECTION CLONING METHOD 



Background 

A basic tool in the field of recombinant genetics is the conversion of poly(A)* mRNA to double-stranded 
(ds) cDNA, which then can be inserted into a cloning vector and expressed in an appropriate host cell. 

s Molecular cloning methods for ds cDNA have been reviewed, for example by Williams, "The Preparation 
and Screening of a cDNA Clone Bank," in Williamson, ed., Genetic Engineering , Vol. 1, p. 2, Academic 
Press, New York (1981); Maniatis, "Recombinant DNA", in Prescott ed., Cell Biology , Academic Press, New 
York (1980); and Efstratiadis et aL "Cloning of Double-Stranded DNA," in Stelo et aL, Genetic Engineering , 
Vol. 1, p. 15, Plenum Press, New York (1979). 

w A substantial number of variables affect the successful cloning of a particular gene and cDNA cloning 
strategy thus must be chosen with care. A method common to many cDNA cloning strategies invojves the 
construction of a "cDNA library" which is a collection of cDNA clones derived from the total po!y(A) + mRNA 
derived from a cell of the organism of interest. 

A mammalian cell may contain up to 30,000 different mRNA sequences, and the number of clones 

is required to obtain low-abundance mRNAs, for example, may be much greater. Methods of constructing 
genomic eukaryotic DNA libraries in different expression vectors, Including bacteriophage X , cosmids, and 
viral vectors, are known. Some commonly used methods are described, for example, in Maniatis et aL, 
Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, publisher, Cold Spring Harbor, 
New York (1982). 

20 Once a genomic cDNA library has been constructed, it is necessary to isolate from the thousands of 
host cells the cell containing the particular human gene of interest. Many different methods of isolating 
target genes from cDNA libraries have been utilized, with varying success. These include, for example, the 
use of nucleic acid probes, which are labeled mRNA fragments having nucleic acid sequences complemen- 
tary to the DNA sequence of the target gene. When this method is applied to cDNA clones of abundant 

25 mRNAs in transformed bacterial hosts, colonies hybridizing strongly to the probe are likely to contain the 
target DNA sequences. The identity of the clone then may be proven, for example, by in situ 
hybridization/selection (Goldberg et aL, Methods Enzymol. , 68:206 (1979)) hybrid-arrested translation 
(Paterson et aL, Proceedings of the National Academy of Sciences , 74:4370 (1977)), or direct DNA 
sequencing (Maxam and Gilbert, Proceedings of the National Academy of Sciences , 74:560 (1977); Maat 

30 and Smith, Nucleic Acids Res. , 5:4537 (1 978)). 

Such methods, however, have major drawbacks when the object is to clone mRNAs of relatively low 
abundance from cDNA libraries. For example, using direct in situ colony hybridization, it is very difficult to 
detect clones containing cDNA complementary to mRNA species present in the initial library population at 
less than one part in 200. As a result, various methods for enriching mRNA in the total population (e.g. size 

35 fractionation, use of synthetic oligodeoxynucleotides, differential hybridization, or immunopurification) have 
been developed and are often used when low abundance mRNAs are cloned. Such methods are described, 
for example, in Maniatis et aL, Molecular Cloning: A Laboratory Manual , supra . 

Many functional eukaryotic proteins initially exist in the form of precursor molecules which contain 
leader or signal sequences at their N-terminal ends. These leader sequences bind to the cell membrane 

40 and draw the remainder of the protein through the lipid bi layer, after which the signal sequence is cleaved 
from the protein by a signal peptidase enzyme. The protein thus functions only after secretion from the 
ceils (for example, insulin, serum albumin, antibodies, and digestive tract enzymes), or after the proteins 
have been anchored to the outer surface of a cell membrane (for example, histocompatibility antigens). 
The cell surface antigens characteristic of mammalian T lymphocytes are additional examples of 

45 proteins that anchor to the cell surface. In mammals, certain cells derived from bone marrow mature into 
lymphocytes, which are present in the lymphoid organs, including the thymus, spleen, lymph nodes, and 
lymphoid aggregates, and also circulate actively through the blood and lymph systems. Mature lymphocyte 
cells may be divided into two populations: thymus-dependent (T) lymphocytes and thymus-independent (B) 
lymphocytes. T lymphocytes migrate to the interior of the thymus, where they undergo differentiative 

so proliferation. During their differentiation process, they express characteristic cell surface membrane alloan- 
tigens, including Thy-1, TLA, gv-1, Ly-1, Ly-2, Ly-3, and Ly-5. As they mature, T lymphocytes lose thaTLA 
antigens and some of the Thy-1 antigens, and gain histocompatibility antigens, acquiring the membrane 
conformation typical of the recirculating T lymphocytes. This is described, for example, by Mota, "Activity 
of Immune Cells," in Bier et aL, eds., Fundamentals of Immunology , 2d Ed., Springer- Verlag, Berlin, pp. 35- 
62(1986). 
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T lymphocytes are involved indirectly in the formation of antibodies and their activities thus have 
required complex analysis of cell function, rather than simple antibody titer measurement Partly due to this, 
their importance in development of immunologic competence was not recognized until relatively recently. 
Mature T lymphocytes synthesize and express an unique pattern of surface glycoprotein antigens which 

5 serve as markers for identification of different T lymphocyte subpopulations, including T helper ceils, T 
suppressor cells, and T cytotoxic cells. Each of these subpopulations plays a very important role in 
regulating the immune system. (Mota, supra) . 

In humans, the functional and phenotypic heterogeneity of T lymphocytes is well accepted. Two major 
subpopulations are known: effector T cells mediating cellular immunity; and regulator T cells containing 

10 helper and suppressor T lymphocytes. These two subpopulations have been defined with heteroantisera, 
autoantibodies, and monoclonal antibodies directed at cell surface antigens. For example, earlier in their 
development, human lymphoid cells in the thymus express an antigen designated T1 1 which reacts strongly 
to a monoclonal antibody designated Cluster of Differentiation 2 (CD2), and react slightly with monoclonal 
antibody CDS to cell surface antigen T1. During maturation, these cells lose T11 (CD2) and acquire three 

rs new antigens defined by monoclonal antibodies CD4, CD8, and CD1. With further maturation, the 
thymocytes cease to express cell surface antigens reactive with monoclonal antibody CD1 , express the T3 
antigen reactive with monoclonal antibody CD3, and then segregate into two subpopulations which express 
either T4 (CD4) or T8 (CD8) antigen. Immunologic competence is acquired at this stage, but is not 
completely developed until thymic lymphocytes migrate outside the thymus. (Mota, supra .) In contrast with 

20 the majority of thymocytes, circulating T lymphocytes express the T1 (CD5) and T3 (CD3) antigens. The T4 
(CD4) antigen is present on approximately 55-65% of peripheral T lymphocytes, whereas the T8 (CD8) 
antigen is expressed on 20-30%. These two subpopulations correspond to helper and to suppressor and 
cytotoxic T cells, respectively. 

In addition to providing a convenient means of distinguishing T lymphocyte subpopulations, these cell 

25 surface antigens are important for mature T cell activation and effector function. T cell activation involves a 
complex series of cell surface interactions between the T ceil and the target cell or stimulator cell in 
addition to binding of the T cell receptor to its specific antigen. 

For example, CD2, the human T cell erythrocyte receptor, allows thymocytes and T-lymphocytes to 
adhere to target cells (e.g., erythrocytes) and to thymic epithelium. This occurs via a specific molecular 

30 ligand for CD2, designated LFA-3, in humans, which is a widely distributed surface antigen. This 
phenomenon has long been employed to detect, assay and purify human cells producing antibodies to 
sheep erythrocytes and serves as the basis for the E-rosette test, first described by Zaalberg, Nature , 
202:1231 (1964). CD2/LFA-3 interactions also have been shown to mediate cytolytic target conjugation 
(Shaw et aL, Nature , 323:262-264 (1986), and the mixed lymphocyte reaction (Martin et aL, J. Immunol. , 

35 131:180-185 (1983). Anti-CD2 monoclonal antibodies can directly activate peripheral T-lymphocytes via an 
antigen-independent pathway (Meuer et at, Cell, 36:897-906 (1984), indicating an even wider im- 
munoregulatory role for CD2. 

Recognition that T lymphocytes are the main effectors of cell-mediated immunity and also are involved 
as helper or suppressor cells in modulating the immune response has resulted in a significant contribution 

40 to the increasing practical application of clinical immunology to medicine. The scope of this application 
includes defense against infections, prevention of diseases by immunization, organ transplantation, blood 
banking, deficiencies of the immune system, and a variety of disorders that are mediated by immunologic 
mechanisms. Moreover, immunologic techniques frequently are used in the clinical laboratory, as in the 
measurement of hormones and drugs. Clinical immunology is described, for example, in Weir, ed., 

45 Handbook of Experimental Immunology in Four Volumes: Volume 4^ Applications of Immunological Methods 
in Biomedical Sciences , 4th Ed., Blackweil Scientific Publications, Oxford (1986); Boguslaski et aL, eds., 
Clinical Immunochemistry: Principles of Methods and Applications , Little, Brown & Co., Boston (1984); 
Holborow et aL, eds., Immunology in Medicine: A Comprehensive Guide to Clinical Immunology , 2d Ed., 
Grune & Stratton, London (1983); and Petersdorf et aL, eds., Harrison's Principles of Internal Medicine . 10th 

so ed., McGraw-Hill, New York, publisher, pp. 344-391 (1983). Clearly, a more thorough understanding of the 
proteins which mediate the immune system would be of significant value in clinical immunology. 

Use of mammalian expression libraries to isolate cDNAs encoding mammalian proteins such as those 
described above would offer several advantages. For example, the protein expressed in a mammalian host 
cell should be functional and should undergo any normal posttranslationa! modification. A protein ordinarily 

55 transported through the intracellular membrane system to the cell surface should undergo the complete 
transport process. A mammalian expression system also would allow the study of intracellular transport 
mechanisms and of the mechanism that insert and anchor cell surface proteins to membranes. 

One common mammalian host cell, called a "COS" cell, is formed by infecting monkey kidney cells 
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with a mutant viral vector, designated simian virus strain 40 (SV40), which has functional early and late 
genes, but lacks a functional origin of replication. In COS cells, any foreign DNA cloned on a vector 
containing the SV40 origin of replication will replicate because SV40 T antigen is present in COS cells. The 
foreign DNA wil! replicate transiently, independently of the cellular DNA. 

5 With the exception of some recent lymphokine cDNAs isolated by expression in COS cells (Wong, 
G.G., et aL, Science, 228:810-815 (1985); Lee, F. et aL, Proceedings of the National Academy of Sciences, 
USA, 83-2061 -2065 (1986); Yokota, T., et aL, Proceedings of the National Academy of Sciences, USA , 
83-589T5898 (1986); Yang, Y., et aj., Cell , 47:3-10 (1986)), however, few cDNAs in general are isolated from 
mammalian expression libraries. There appear to be two principal reasons for this: First, the existing 

w technology (Okayama, H. et aL, MoL Cell. Biol. , 2:161-170 (1982)) for construction of large plasmid libraries 
is difficult to master, and library size rarely approaches that accessible by phage cloning techniques. 
(Huynh. T. et a!., In: DNA Cloning Vol. I, A Practical Approach , Glover, D.M. (ed.), IRL Press, Oxford (1985), 
pp. 49-78). Second, the existing vectors are, with one exception (Wong G.G., et aL, Science , 228:810-815 
(1985)), poorly adapted for high level expression, particularly in COS cells. The reported successes with 

is lymphokine cDNAs do not imply a general fitness of the methods used, since these cDNAs are particularly 
easy to isolate from expression libraries. Lymphokine bioassays are very sensitive {(Wong, G.G., et aL, 
Science , 228:810-815 (1985); Lee, F. et aL, Proceedings of the National Academy of Sciences, USA , 
83:2061-2065 (1986); Yokota T. et aL, Proceedings of the National Academy of Sciences, USA , 83:5894- 
5898 (1986); Yang, Y. et aL, Cell, 47:3-10 (1986)) and the mRNAs are typically both abundant and short 

20 (Wong, G.G. et ai., Science , 228:810-815 (1985); Lee, F., et aL, Proceedings of the National Academy of 
Sciences, USAT 83:2061-2065 (1986); Yokota, T., et aL, Proceedings of the National Academy of Sciences, 
USA , 83:5894-5898 (1986); Yang, Y., et aL, Cell, 47:3-10 (1986)). 

Thus, expression in mammalian hosts previously has been most frequently employed solely as a 
means of verifying the identity of the protein encoded by a gene isolated by more traditional cloning 

25 methods. For example, Stuve et aL, J. Virol. , 61 (2) :327-335 (1987), cloned the gene for glycoprotein gB2 of 
herpes simplex type II strain 333 by plaque hybridization of Ml3-based recombinant phage vectors used to 
transform competent E. coli JM101. The identity of the protein encoded by the clone thus isolated was 
verified by transfection of mammalian COS and Chinese hamster ovary (CHO) cells. Expression was 
demonstrated by immunofluorescence and radioimmunoprecipitation. 

so Oshima et aL used plaque hybridization to screen a phase iambdagtH cDNA library for the gene 
encoding human placental beta-glucuronidase. Oshima et aL, Proceedings of the National Academy of 
Sciences, U.S.A., 84:685-689 (1 987). The identity of isolated cDNA clones was verified by immunoprecipita- 
tion of the protein expressed by COS-7 cells transfected with cloned inserts using the SV40 late promoter. 
Transient expression in mammalian cells has been employed as a means of confirming the identity of 

35 genes previously isolated by other screening methods. Gerald et aL, Journal of General Virology , 67:2695- 
2703(1986). Mackenzie, Journal of Biological Chemistry , 262:14112-14117 (1986); Seif et aL, Gene , 
43:1 1 1 1-1 121 (1 986); Orkin et aL, Molecular and Cellular Biology , 5(4):762-767 (1985). These methods often 
are inefficient and tedious and require multiple rounds of screening to identify full-length or overlapping 
clones. Prior screening methods based upon expression of fusion proteins are inefficient and require large 

40 quantities of monoclonal antibodies. Such drawbacks are compounded by use of inefficient expression 
vectors, which result in protein expression levels that are inadequate to enable efficient selection. 



Summary of the Invention 

45 

The present invention relates to a powerful new method for cloning cDNA encoding cell surface 
antigens, to a method of construction cDNA libraries to high efficiency expression vectors particularly suited 
for high level expression in eukaryotic host cells, and to the isolated nucleotide sequences and their 
encoded products. 

so The highly efficient cloning technique of the present invention is based upon transient expression of 
antigen in eukaryotic ceils and physical selection of cells expressing the antigen by adhesion to an 
antibody-coated substrate, such as a culture dish. The methods of the present invention are useful for the 
isolation and molecular cloning of any protein which can be expressed and transported to the cell surface 
membrane of a eukaryotic cell. 

55 The method for cloning cDNA encoding a cell surface antigen of the present invention comprises 
preparing a cDNA library; introducing this cDNA library into eukaryotic mammalian preferably tissue culture 
ceils; culturing these cells under conditions allowing expression of the cell surface antigen; exposing the 
ceils to a first antibody or antibodies directed against the cell surface antigen, thereby allowing the 
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formation of a cell surface antigen-first antibody complex; subsequently exposing the cells to a substrate 
coated with a second antibody directed against the first antibody, thereby causing cells expressing the celi 
surface antigen to adhere to the substrate via the formation of a cell surface antigen-first antibody-second 
antibody complex; and separating adherent from non-adherent cells. 

5 By means of the cloning method of the present invention, isolation and molecular cloning of genes 
encoding such cell surface antigens as the following have been accomplished: the CHI a, CD1b, CD1c, 
CD2, CD6, CD7, CD13, CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD28, CD31, CDw32a, CDw32b, 
CD33, CD34, CD36, CD37, CD38, CD39, CD40, CD43, ICAM, LFA-3, FcRIa, FcRIb, TLiSa, and Lev-8 
antigens. The nucleotide sequences of genes cloned by the method of the present invention have been 

10 determined and the amino acid sequences of the encoded proteins have been identified. A cloned gene, 
such as that encoding CD1a, CD1b, CD1c, CD2, CD6, CD7, CD13, CD14, CD16, CD19, CD20, CD22, CD26, 
CD27, CD28, CD31 , CDw32a, CDw32b, CD33, CD34, CD36, CD37, CD38, CD39, CD40, CD43, ICAM, LFA- 
3, FcRIa, FcRIb, TLiSa, and Lev-8, is also the subject of the present invention. 

Once the gene encoding an antigen has been cloned according to the method of the present invention, 

is that gene can be expressed in a prokaryotic or a eukaryotic host cell to produce the encoded protein or 
portion thereof in substantially pure form such as it does not exist in nature. Another aspect of the present 
invention relates to substantially pure celi surface antigens, particularly: CD! a, CD1b, CD2, CD6, CD7, 
CD13, CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD28, CD31, CDw32a, CDw32b, CD33, CD34, 
CD36, CD37, CD38, CD39, CD40, CD43, ICAM, LFA-3, FcRIa, FcRIb, TLiSa, and Lev-8 antigens and their 

20 functional analogues and equivalents. The primary amino acid sequences of the CDla, CDlb, CD2, CD7, 
CD14, CD16, CD19, CD20, CD22, CD28, CDw32a, CDw32b, CD33, CD34, CD40, ICAM, LFA-3, FcRIa, 
FcRIb and TLiSa antigens have been determined. The invention thus also relates to the amino acid 
sequences of those antigens and their function equivalents and to the nucleotide sequences encoding those 
antigens. 

25 This invention also relates to high efficiency cDNA expression vectors which allow the generation of 
very large mammalian expression libraries and yield large amounts of protein in mammalian host cells, 
resulting in efficient selection. In a particular embodiment of this invention, a cDNA expression vector 
comprises a suppressor tRNA gene; an SV40 origin; a synthetic transcription unit, comprising a chimeric 
promotor composed of human cytomegalovirus AD169 immediate early enhancer sequencer fused to the 

30 HIV LTR -60 to* +80 sequences, inserted between the suppressor tRNA gene and the SV40 origin; a 
polylinker comprising two BstXI sites separated by a replaceable DNA sequence and flanked by Xba l sites; 
and an SV40 small t antigeTTspIice and early region polyadenylation signals. 

A further aspect of the present invention comprises a synthetic transcription unit for use in a cDNA 
expression vector, comprising a chimeric promoter composed of human cytomegalovirus AD169 immediate 

35 early enhancer sequences fused to HIV LTR -60 to +80 sequences. The small size and particular 
arrangement of the sequences of the cDNA expression vector of the present invention allow highly efficient 
replication in host mammalian tissue culture cells, such as COS ceils. Moreover, this vector employs a 
polylinker containing two inverted BstXI sites separated by a short replaceable DNA segment, which allows 
the use of a very efficient oligonucieotide-based cDNA insertion strategy. 

40 In another aspect, the present invention comprises a vector comprising two identical BstXI sites in 
inverted orientation each with respect to the other, which BstXI sites are separated by a short replaceable 
DNA fragment. Another aspect of the invention is a polylinker as described above. 

A further aspect of the invention relates to an oligonucleotide-based cDNA insertion method, comprising 
ligating synthetic DNA oligonucleotides to the cDNA segment desired to be inserted into a vector, the 

45 synthetic DNA oligonucleotides giving the same terminal sequences as those of the short replaceable DNA 
fragment of the polylinker of the invention, and inserting the resulting cDNA segment plus synthetic DNA 
oligonucleotide terminal sequences into the polylinker of the vector, from which the short replaceable DNA 
fragment previously has been removed. 

In preparing cDNA libraries according to the present invention, it has been discovered that many tumors 

so are heavily infiltrated by macrophages and lymphocytes, and thus may be employed as a source of 
macrophage or lymphocyte transcripts to good effect, instead of tumor cell lines commonly used. In another 
aspect, then, the present invention relates to the use of tumor cells, particularly human tumor cells, to 
prepare cDNA libraries for use according to the methods of the present invention. 

Another advantage of the powerful selection system of the present invention is that directional insertion 

55 of the cDNA is not necessary. The method of the present invention results in library construction 
efficiencies which are on a par with those described for phage vectors such as X gt10 and x gt11, with the 
additional advantage that clones generated according to the methods of the present invention are easier to 
manipulate. 
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The immunoselection technique of the present invention allows efficient use of antibodies, which may 
be monoclonal or polyclonal, in relatively small absolute amounts. The method of the present invention also 
is quite rapid. Generally, three or fewer cycles of immunoselection and rescue are required to isolate a 
target cDNA clone. Thus, the method of the present invention also results in the efficient use of labor and 

5 materials when cloning genes encoding cell surface antigens. As described above, this method has been 
employed to successfully clone genes encoding cell surface antigens associated with mammalian T 
lymphocytes (e.g. antigens CD1a, CD1b, C01c, CD2, CD6, CD7, CD13, CD14, CD16, CD19, CD20, CD22, 
CD26, CD27, CD28, CD31, CDw32a, CDw32b, CD33, CD34, CD36, CD37, CD38, CD39, CD40, CD43, ICAM, 
LFA-3, FcRIa, FcRIb, TLiSa, and Lev-8). 

io The purified genes and proteins of the present invention are useful for immunodiagnostic and 
immunotherapeutic applications, including the diagnosis and treatment of immune-mediated infections, 
diseases, and disorders in animals, including humans. They can also be used to identify, isolate and purify 
other antibodies and antigens. Such diagnostic and therapeutic uses comprise yet another aspect of the 
present invention. Moreover, the substantially pure proteins of the present invention may be prepared as 

is medicaments or pharmaceutical compositions for therapeutic administration. The present invention further 
relates to such medicaments and compositions. 



Brief Description of the Drawings 

20 

Figure 1. Nucleotide sequence of expression vector piH3 

Nucleotides 1-589 are derived from pMB1 origin (pBR322 ori); nucleotides 590-597 are derived from 
25 the Sacll linker (ACCGCGT); nucleotides 598-799 are derived from the synthetic tyrosine suppressor tRNA 
gene (supF gene); nucleotides 800-947 are derived from a remnant of the ASV LTR fragment (Pvull to 
Mlul); nucleotides 948-1500 are derived from the human cytomegalovirus AD169 enhancer, nucleotides 
1501-1650 are derived from HIV TATA and tat-responsive elements; nucleotides 1651-1716 are derived 
from the piLNXAN polylinker (Hind II I to Xba ); nucleotides 1717-2569 are derived from pSV to splice and 
30 poly-Addition signals; nucleotides 2570-2917 are derived from the SV40 origin of replication (pvu ll to 
(Hindi!!); and nucleotides 2918-2922 are derived from piVX, remnant of R1 site from polylinker. 



Figure 2. Nucleotide sequence of the CD2cDNA insert 

35 

Nucleotide numbering is given in parentheses at right, amino acid numbering, center and left. Locations 
of the potential sites for addition of asparagine-l inked carbohydrate (CHO) are shown, as well as the 
predicted transmembrane (TM) sequence. The amino acid sequence is numbered from the projected 
cleavage site of the secretory signal sequence. Proline residues are underscored with asterisks. The 
40 presumed polyadenylation signal for the 1.65kb transcript, and a possible polyadenylation signal for the 
1 .3kb transcript, are shown boxed. 



Figure 3. Restriction map of the CDM8 expression vector 

45 

The CDM8 vector includes a deleted version of a mutant polyoma virus early region selected for high 
efficiency expression in both murine and monkey cells. Substantially all of the human immunodeficiency 
promoter region has been replaced with the cognate sequences of the human cytomegalovirus immediate 
early promoter, and by inclusion of a bacteriophage T7 promoter between the eukaryotic promoter and the 
so site of cDNA insertion. Arrows indicate the direction of transcription. 



Figure 4. Nucleotide sequence and corresponding amino acid sequence of the LFA-3 antigen 

55 WOP cells transfected with a clone encoding the LFA-3 antigen were detected by indirect im- 
munofluorescence, amplified and sequenced. (A) shows the 874 base pair insert containing an open reading 
frame of 237 residues originating at a methionine codon, and terminating in a series of hydrophobic 
residues. Hydrophobic and hydrophilic regions within this open reading frame are shown in (B). 
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Figure 5. Restriction Map of the piH3M vector 

The direction of transcription is indicated by an arrow. Restriction endonuclease sites flanking the BstX I 
cloning sites are shown. 

s 

Figure 6. Nucleotide sequence of the piH3M vector 

There are 7 segments. Residues 1-587 are from the pBR322 origin of replication, 588-1182 from the 
10 M13 origin, 1183-1384 from the supF gene, 1385-2238 are from the chimeric cytomegalovirus/human 
immunodeficiency virus promoter, 2239-2647 are from the replaceable fragment, 2648-3547 from plasmid 
pSV2 (splice and polyadenylation signals), and 3548-3900 from the SV40 virus origin. 



15 Figure 7. Nucleotide sequence of the CD28 cDNA 

Nucleotide numbering is given in parentheses at right, amino acid numbering, center and left. Location 
of the potential sites for addition of asparagine- linked carbohydrate (CHO) are shown, as well as the 
predicted transmembrane (TM) sequence. The amino acid sequence is numbered from the projected 
20 cleavage site of the secretory signal sequence. The presumed polyadenylation signal is shown boxed. 



Figure 8. Nucleotide sequence of the CD7 cDNA insert 

25 Nucleotide numbering is given in parentheses at right The amino acid sequence is numbered from the 
projected cleavage site of the secretory signal sequence. Splice donor and acceptor sites indicated by (/). 
The location of the potential sites for addition of asparagine-l inked carbohydrate (CHO) are shown, the 
potential fatty acid esterification site is denoted {*). and tne predicted transmembrane domain (TM) is 
underlined. Nucleotide sequences potentially involved in hairpin formation are denoted by {.), The presumed 

30 polyadenylation signal is underlined. 



Figure 9, Nucleotide sequence of the CDw32 cDNA 

35 Nucleotide number is given in the parenthesis at right, amino acid numbering, center and left. Locations 
of the potential sites for addition of asparagine-linked carbohydrate (CHO) are shown, as well as the 
predicted transmembrane (TM) sequence. The amino acid sequence is numbered from the projected 
cleavage site of the secretory signal sequence. Cysteine residues are underscored with asterisks. 

40 

Figure 10. Sequence of the CD20.4 cDNA 

A. The sites of potentional N-linked glycosylation are denoted by the symbol -CHO-: the hydrophobic 
regions are underscored. The site of the poly(A) + tail in clone CD20.6 is denoted by an asterisk. 
45 B. Hydrophobicity profile of the amino acid sequence in A. 



Figure 1 1 . Sequence of ICAM-1 

so Complete nucleotide sequence of ICAM-1 cDNA insert and predicted protein sequence. Nucleotide 
numbering is at right, amino acid numbering in parentheses at left. The RGE motif at position 152 is 
underlined, the potentional N-linked glycosylation sites are indicated by -CHO- and the transmembrane 
domain by -TM-. The amino acid sequence is numbered from the projected cleavage site of the signal 
peptide. Sequencing was by dideoxy-chain termination (Sanger, F., et aL, Proc. Natl. Acad. Sci. USA , 

55 74:5463-5467 (1977)), using a combination of subclones, and specific oligonucleotides. 



Figure 1 2. Nucleotide sequence of CD19 
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Figure 13. Nucleotide sequence of CD20 



Figure 1 4. Nucleotide sequence of CD22 

5 ~~ 

Figure 15. Nucleotide sequence of CDw32a 
10 Figure 16. Nucleotide sequence of CDw32b 



Figure 1 7. Nucleotide sequence of CD40 

15 

Detailed Description of the Invention 

This invention relates to a novel method for cloning cDNA encoding a cell surface antigen and to a 
method of constructing cDNA libraries. It also relates to particular cDNA expression vectors and compo- 
20 nents thereof, nucleotide sequences or genes isolated by the method, substantially pure cell surface 
antigens encoded by the cDNA segments, and methods of using the isolated nucleotide sequences and 
encoded products. 

In the following description, reference will be made to various methodologies known to those of skill in 
the art of recombinant genetics. Publications and other materials setting forth such known methodologies to 
25 which reference is made are incorporated herein by reference In their entireties. 

Standard reference works setting forth the general principles of recombinant DNA technology include 
Darnell, J.E. et al., Molecular Cell Biology , Scientific American Books, Inc., publisher, New York, N.Y. 
(1986); LewinrBlM., Genes H, John Wiley & Sons, publisher, New York, N.Y. (1985); Old, R.W. et ah, 
Principles of Gene Manipulation: An Introduction to Genetic Engineering , 2d edition, University of California 
30 Press, Berkeley, CA (1981); and Maniatis, T. et aL, Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor, NY (1982). 

By "cloning" is meant the use of in vitro recombination techniques to insert a particular gene or other 
DNA sequence into a vector molecule. In order to successfully clone a desired gene, it is necessary to 
employ methods for generating DNA fragments, for joining the fragments to vector molecules, for 

35 introducing the composite DNA molecule into a host cell in which it can replicate, and for selecting the 
clone having the target gene from amongst the recipient host cells. 

By "cDNA" is meant complementary or copy DNA produced from an RNA template by the action of 
RNA-dependent DNA polymerase (reverse transcriptase). Thus a "cDNA clone" means a duplex DNA 
sequence complementary to an RNA molecule of interest, carried in a cloning vector. 

40 By "cDNA library" is meant a collection of recombinant DNA molecules containing cDNA inserts which 
together comprise the entire genome of an organism. Such a cDNA library may be prepared by art- 
recognized methods described, for example, in Maniatis et al., Molecular Cloning: A Laboratory Manual , 
supra. Generally, RNA is first isolated from the cells of an organism from whose genome it is desired to 
clone a particular gene. Preferred for the purposes of the present invention are mammalian, and particularly 

45 human, cell lines. More preferred are the human tumor cell line HPB-ALL and the human lymphoblastoid 
cell line JY. Alternatively, RNA can be isolated from a tumor cell, derived from an animal tumor, and 
preferably from a human tumor. Thus, a library may be prepared from, for example, a human adrenal 
tumor, but any tumor may be used. 

The immunoselection cloning method of the present invention comprises the preparation of a cDNA 

so library by extracting total RNA including a particular gene from a cell, synthesizing a series of complemen- 
tary double-stranded cDNA fragments from the RNA and introducing these cDNA fragments into mamma- 
lian cells in tissue culture. The mammalian cells are maintained under conditions which allow them to 
express the protein (i.e. the cell surface antigen). The resulting cells are exposed to a first antibody or pool 
(group) of antibodies directed against the cell surface antigen. This results in formation of a cell surface 

55 antigen-first antibody complex. The complexes are exposed to a substrate to which is coated or bound a 
second antibody directed against the first antibody. Cells expressing the ceil surface antigen adhere to the 
substrate (because of formation of a cell surface antigen-first antibody-second antibody complex). Adherent 
ceils are separated from non-adherent cells. 
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Isolation of total RNA 

The guanldium thiocyanate/CsCI method of isolating total RNA Is preferred. More preferred is a 
guanidium thiocyanate/UCI variant of the GuSCN/CsCI method, which has added capacity and speed. 

s Briefly, for each ml of mix desired, 0.5g GuSCN are dissolved in 0.58ml of 25% LiCI (stock filtered through 
0.45 micron filter) and 20ul of mercaptoethanol is added. Cells are spun out and the pellet is dispersed on 
walls by flicking, add 1ml of solution to up to 5 x 10 7 cells. The resulting combination is sheared by 
polytron until nonviscous. For small scale preps (less than 10 s cells) layer 2mi of sheared mix on 1.5ml of 
5.7M CsCI (RNase free; 1.26 g CsCI added to every mi of 10mM EDTA pH 8), overlay with RNase-free 

10 water and spin SW55 SOkrpm 2h. For large scale preps, layer 25ml of 12rni CsCI in a SW28 tube, overlay, 
and spin 24k rpm 8h. Aspirate contents carefully with a sterile pasteur pipet connected to a vacuum flask. 
Once past the CsCI interface, scratch a band around the tube with the pipet tip to prevent the layer on the 
wall of the tube from creeping down. The remaining CsCI solution is aspirated. The pellets are taken up in 
water (do not try to redissolve). 1/10 vol. NaOAc and 3 vol. EtOH are added and the resulting combination is 

15 spun. If necessary, the pellet is resuspended in water (e.g., at 70°). Adjust concentration to 1mg/ml and 
freeze. Small RNA (e.g. 5S) does not come down. For small amounts of cells, scale down volumes and 
overlay GuSCN with RNase-free water on gradient (precipitation is inefficient when RNA is dilute). 

20 Preparation of poly A* RNA 

Next, polyA* RNA may be prepared, preferably by the oligo dT selection method. Briefly, a disposable 
polypropylene column is prepared by washing with 5M NaOH and then rinsing with RNase-free water. For 
each milligram total RNA about 0.3 ml (final packed bed) oligo dT cellulose is used. Oligo dT cellulose is 

25 prepared by resuspending about 0.5 ml of dry powder in 1 ml of 0.1 M NaOH and transferring it into the 
column, or by percolating 0.1 NaOH through a previously used column (columns can be reused many 
times). This is washed with several column volumes of RNase-free water, until pH is neutral, and rinsed with 
2-3 ml of loading buffer. The column bed is then removed into a sterile 15ml tube using 4-6 ml of loading 
buffer. The total RNA to 70 *C for 2-3 min., LiCI from RNase-free stock is added (to 0.5M), and combined 

30 with oligo dT cellulose in a 15 ml tube. This is followed by vortexing or agitation for 10 min. The result is 
poured into a column- and washed with 3 ml loading buffer and then 3 ml of middle wash buffer. mRNA is 
eluted directly into an SW55 tube with 1.5 ml of 2mM EDTA, 0.1% SDS; the first two or three drops are 
discarded. 

Eluted mRNA is precipitated by adding 1/10 vol. 3M NaOAc and filling the tube with EtOH. This is then 
35 mixed, chilled for 30 minutes at -20* C, and spun at 50k rpm at 5° C for 30 min. The EtOH is poured off and 
the tube is air dried. The mRNA pellet is resuspended in 50-1 OOul of RNase-free water. Approximately 5 ul 
is melted at 70 ' in MOPS/EDTA/formaldehyde and run on an RNase-free 1% agarose gel to check quality. 



40 cDNA Synthesis 

From this, cDNA is synthesized. A preferred method of cDNA synthesis is a variant of that described by 
Gubler and Hoffman, (Gene 25:263-269 (1982)). This is carried out as follows: 

a. First Strand. 4 ug of mRNA and heated to about 100'C in a microfuge tube for 30 seconds and 
46 quenched on ice. The volume is adjusted to 70 u I with RNase-free water. The following are added: 20 ul of 

RT1 buffer, 2 ul of RNAse inhibitor (Boehringer 36 u/ul), 1 ul of 5 ug/ul of oligo dT (Collaborative Research), 
2.5 ul of 20 mM dXTP's (ultrapure), 1 ul of 1 M DTT and 4 ul of RT-LX (Life Science, 24 u/ul). The resulting 
combination is incubated at 42" C for 40 min. It Is heated to inactivate (70* C 10 min). 

b. Second Strand. 320 ul of RNAse free water, 80 ul of RT2 buffer, 5 ul of DNA Polymerase I 
50 (Boehringer, 5 U/ui), 2 ul RNAse H (BRL 2 u/ul). Incubate at 15*C for 1 hr and 22°C for 1 hr. Add 20 ul of 

0.5M EDTA pH 8.0, phenol extract and EtOH precipitate by adding NaCi to 0.5M, linear polyacrylamide 
(carrier) to 20ug/ml, and filling tube with EtOH. Spin 2-3 minutes in microfuge, remove, vortex to dislodge 
precipitate high up on wall of tube, and respin 1 minute. 

c. Adaptors. Resuspend precipitated cDNA in 240 ul of TE (1 0/1 ). Add 30 ul of 1 0x low salt buffer, 
55 30ul of 10X low salt buffer, 30ul of 10X ligation additions, 3ul (2.4ug) of kinased 12-mer adaptor, 2ul (1.6ug) 

of kinased 8-mer adaptor, and 1 ul of T4 DNA ligase (BioLabs, 400 u/ul, or Boehringer, 1 Weiss unit/ml). 
Incubate at 15 s C overnight. Phenol extract and EtOH precipitate as above (no extra carrier now needed), 
and resuspend in 100 ul of TE. 
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Use of cDNA fragments in expression vectors 

5 

For use with the BstXI-based cDNA expression vectors of the invention, (see infra ) oligonucleotide 
segments containing terminal sequences corresponding to BstXl sites on the vectors are ligated to the 
cDNA fragment desired to be inserted. The resulting fragments are pooled by fractionation. A preferred 
method is as follows: 

io Prepare a 20% KOA, 2mM EDTA, 1 ug/ml EthBr solution and a 5% KOAc, 2mM EDTA, 1 ug/mlg EthBr 
solution. Add 2.6 ml of 20% KOAc solution to back chamber of a small gradient maker. Remove air bubble 
from tube connecting the two chambers by allowing solution to flow into the front chamber and then tilt 
back. Close passage between chambers, and add 2.5ml. of the 5% solution to the front chamber. If there is 
liquid in the tubing from a previous run, allow the 5% solution to run just to the end of the tubing, and then 

15 return to chamber. Place the apparatus on a stirplate, set the stir bar moving as fast as possible, open the 
stopcock connecting the two chambers and then open the front stopcock. Fill a polyallomer SW55 tube 
from the bottom with the KOAc solution. Overlay the gradient with 100 ul of cDNA solution. Prepare a 
balance tube and spin the gradient for 3 hrs at 50k rpm at 22 " C. To collect fractions from the gradient, 
pierce the SW55 tube with a butterfly infusion set (with the luer hub clipped off) close to the bottom of the 

20 tube and collect three 0.5ml fractions and then 6 0.25ml fractions into microfuge tubes (about 22 and 11 
drops respectively). EtOH precipitate the fractions by adding linear polyacrylamide to 20 ug/ml and filling 
the tube to the top with EtOH. After cooling tubes, spin them in a microfuge for 3 min. Vortex and respin 1 
min. Rinse pellets with 70% EtOH (respin). Do not dry to completion. Resuspend each 0.25ml fraction in 10 
ui of TE. Run 1 ui on a 1% agarose minigel. Pool the first three fractions, and those of the last six which 

25 contain no material smaller than 1 kb. 

Suppressor tRNA plasmids may be propagated by known methods. In a preferred method according to 
the present invention, supF plasmids can be selected in nonsuppressing hosts containing a second plasmid, 
p3, which contains amber mutated ampiciilin and tetracycline drug resistance elements (Seed, 1983). The 
p3 plasmid is derived from PR1, is 57kb in length, and is a stably maintained, single copy episome. The 

30 ampiciilin resistance of this plasmid reverts at a high rate, so that amp r plasmids usually cannot be used in 
p3-containing strains. Selection for tet resistance aione is almost as good as selection for ammp+tet 
resistance. However, spontaneous appearance of chromosomal suppressor tRNA mutations presents an 
unavoidable background (frequency about 10~ 9 ) in this system. Colonies arising from spontaneous suppres- 
sor mutations are usually bigger than colonies arising from plasmid transformation. Suppressor plasmids 

35 typically are selected for in LB medium containing amp at 12.5 ug/ml and tet at 7.5 ug/ml. For large plasmid 
preps, M9 casamino acids medium containing glycerol (0.8%) may be used as a carbon source, and the 
bacteria grown to saturation. 

Vector DNA may be isolated by known methods. The following method is preferred for plasmid from 1 
liter of saturated cells: 

40 Spin down cells in 1 liter J6 bottles, 4.2k rpm, 25 minutes. Resuspend in 40 mi 10mM EDTA pH 8 
(Thump on soft surface). Add 80 ml 0.2M NaOH, 1% SDS, swirl until clearish, viscous. Add 40 ml 5M 
KOAc, pH4.7 (2.5M KOAc, 2.5M HOAc) shake semi-vigorous ly (until lumps are 2-3 mm in size). Spin (same 
bottle) 4.2 rpm, 5 min. Pour supernatant through cheesecloth into 250 ml bottle. Fill bottle with isopropyl 
alcohol. Spin J6, 4.2k rpm, 5 min. Drain bottle, rinse gently with 70% EtOH (avoid fragmenting the pellet). 

45 Invert bottle, and remove traces of EtOH with Kimwipe. Resuspend in 3.5 ml Tris base/EDTA 20mM/10mM. 
Add 3.75 ml of resuspended pellet to 4.5g CsCI. Add 0.75 ml 10/mg/ml ethidium bromide, mix. Fill VTi80 
tubes with solution. Run at a speed of 80 rpm for 2.5 hours or longer. Extract bands by visible light with 1 
ml syringe and 20 gauge or lower needle. Cut top off tube, insert the needle upwards into the tube at an 
angle of about 30° with respect to the tube, (i.e., as shallowly as possible) at a position about 3mm beneath 

so the band, with the bevel of the needle up. After the band is removed, pour tube contents into bleach. 
Deposit extracted bands in 13 ml Sarstedt tube. Fill tube to top with n-butanol saturated' with 1M NaCI, 
extract If a very large quantity of DNA is obtained, reextract. Aspirate butanol into trap containing 5M NaOH 
(to destroy ethidium). Add about equal volume 1 M ammonium acetate to DNA (squirt bottle). Add about 2 
volumes 95% ethanol (squirt bottle). Spin 10K rpm, 5 min. J2-21. Rinse pellet carefully with 70% ethanol. 

55 Dry with swab, or lyophilizer. 

The vector may be prepared for cloning by known methods. A preferred method begins with cutting 20 
ug of vector in a 200 ul reaction with 100 units of BstXl (New York Biolabs), cutting at 50° C overnight in a 
weil-thermostatted water bath (i.e., circulating water bath). Prepare 2 KOAc 5-20% gradients in SW55 tubes 
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as described above. Add 100 ul of the digested vector to each tube and run for 3 hrs, 50K rpm at 22° C. 

Examine the tube under 300nm UV light. The desired band will have migrated 2/3 of the length of the tube. 

Forward trailing of the band means the gradient is overloaded. Remove the band with a 1 ml syringe and 20 

gauge needle. Add linear polyacryiamide and precipitate the plasmid by adding 3 volumes of EtOH. 
s Resuspend in 50 ul of TE. Set up ligations using a constant amount of vector and increasing amounts of 

cDNAs. On the basis of these trial ligations, set up large scale ligation, which can be accomplished by 

known methods. Usually the entire cDNA prep requires 1-2 ug of cut vector. 

Adaptors may be prepared by known methods, but it is preferred to resuspend crude adaptors at a 

concentration of 1 ug/ul, add MgSO* to 10 mM, and precipitate by adding 5 volumes of EtOH. Rinse with 
70 70% EtOH and resuspend in TE at a concentration of 1 ug/ul. To kinase take 25ul of resuspended adaptors, 

add 3ul of 10X kinasing buffer and 20 units of kinase; incubate 37* C overnight. 

Preparation of buffers mentioned in the above description of preferred methods according to the 

present invention will be evident to those of skiil. For convenience, preferred buffer compositions are as 

follows: 

15 



Loading 


0.5 M LiCI. 10mM Tris pH 7.5, 1mM EDTA0.1% 


Buffer: 


SDS. 


Middle 


0.15 M LiCI, 10mM Tris pH 7.5, 1mM EDTA 


Wash 


0.1% SDS. 


Buffer 




Rt1 Buffer: 


0.25 M Tris pH 8.8 (8.2 at 42° ), 0.25 M KCi, 30 




mM MgCl2. 


RT2 Buffer: 


0.1 M Tris pH 7.5, 25 mM MgCfe, 0.5 M KCI, 




0.25 mg/ml BSA, 50 mM DTT. 


10X Low 


60 mM Tris pH 7.5, 60 mM MgCfe, 50 mM NaCI, 


Salt 


2.5 mg/ml BSA 70 mM Me. 


10X 


1mM ATP, 20 mM DTT, 1 mg/ml BSA 10 mM 


Ligation 


spermidine. 


Additions: 




10X 


0.5 M Tris pH 7.5, 10mM ATP, 20mM DTT, 10 


Kinasing 


mM spermidine, 1 mg/ml BSA 100 mM MgCJ 2 . 


Buffer: 





35 By "vector" is meant a DNA molecule, derived from a piasmid or bacteriophage, into which fragments 
of DNA may be inserted or cioned. A vector will contain one or more unique restriction sites, and may be 
capable of autonomous replication in a defined host or vehicle organism such that the cloned sequence is 
reproducible. Thus, by "DNA expression vector" is meant any autonomous element capable of replicating in 
a host independently of the host's chromosome, after additional sequences of DNA have been incorporated 
into the autonomous element's genome. Such DNA expression vectors include bacterial plasmids and 
phages. 

Preferred for the purposes of the present invention, however, are viral vectors, such as those derived 
from simian virus strain 40 (SV40). SV40 is a papovavirus having a molecular weight of 28 Mdal, and 
containing a circular double-stranded DNA molecule having a molecular weight of 3 Mdal, which comprises 

45 the entire genome of the virus. The entire nucleotide sequence of this single, small, covalentiy closed 
circular DNA molecule has been determined. Fiers et ai., Nature 273:113-120 (1978); Reddy et aJL, Science 
200:494-502 (1978). The viral DNA of SV40 may be obtained in large quantities, and the genomic regions 
responsible for various viral functions have been accurately located with respect to a detailed physical map 
of the DNA. Fiers et ai., supra ; Reddy et aL, supra . The viral genome of SV40 can multiply vegetatively or 

so as an integral part of cellular chromosomes, and a wealth of information exists on the replication and 
expression of this genome. 

Also preferred for the purposes of the present invention is a single-stranded bacteriophage cloning 
vehicle, designated M13, having a closed circular DNA genome of approximately 6.5 kb. An advantage of 
utilizing M13 as a cloning vehicle is that the phage particles released from infected cells contain single- 

55 stranded DNA homologous to only one of the two complementary strands of the cloned DNA, which 
therefore can be used as a template for DNA sequencing analysis. 

Even more preferred for the purposes of the present invention are the expression vectors designated 
piH3, piH3M, and CDM8, deposited at the American Type Culture Collection (ATCC), 12301 Parklawn Drive, 
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Rockville, MD 20852, on February 24, 1988, in E. coli piH3 has accession number ATCC 67,634, pjH3M has 
accession number ATCC 67,633 and CDM8 has accession number ATCC 67,635. 

By "tissue culture" is meant the maintenance or growth of animal tissue cells in vitro so as to allow 
further differentiation and preservation of ceil architecture or function or both. "Primary tissue ceils" are 

5 those taken directly from a population consisting of ceils of the same kind performing the same function in 
an organism. Treating such tissue cells with the proteoiytic enzyme trypsin, for example, dissociates them 
into individual primary tissue ceils that grow weli seeded onto culture plates at high densities. Cell cultures 
arising from multiplication of primary cells in tissue culture are called "secondary cell cultures." Most 
secondary cells divide a finite number of times and then die. A few secondary cells, however, may pass 

w through this "crisis period", after which they are able to multiply indefinitely to form a continuous "cell line." 
Cell lines often will contain extra chromosomes, and usually are abnormal in other respects as well. The 
immortality of these cells is a feature shared in common with cancer cells. 

Preferred cell lines for use as tissue culture cells according to the present invention include the monkey 
kidney cell line, designated "COS." COS cells are those that have been transformed by SV40 DNA 

is containing a functional early gene region but a defective origin of viral DNA replication. COS cell clone M6 
is particularly preferred for use according to the method of the invention. Also preferred for the purposes of 
the present invention are murine "WOP" cells, which are NIH 3T3 cells transfected with polyoma origin 
deletion DNA. cDNA may be introduced into the host tissue culture cells of the present invention by any 
methods known to those of skill. Transfection may be accomplished by, for example, protoplast fusion by 

20 spheroplast fusion, or by the DEAE dextran method (Sussman et al., Cell. Biol. 4:1641-1 643 (1 984)). 

If spheroplast fusion is employed, a preferred method is the following variant based on Sandri-Goldrin 
et aU Mol. Cell Bio. 1743-752 (1981). Briefly, for example, a set of six fusions requires 100 m! of cells in 
broth. GroVceilsoontaining amplifiable piasmid to OD600 = 0.5 in LB. Add spectinomycin to 100 ug/ml (or 
chloramphenicol to 150 ug/ml). Continue incubation at 37* C with shaking for 10-16 hours. (Cells begin to 

25 lyse with prolonged incubation in spectinomycin or chloramphenicol medium). Spin down 100 ml of culture 
(JA14/GSA rotor, 250ml bottle) 5 min. at 10,000 rpm. Drain well, resuspend pellet in bottle with 5ml cold 
20% sucrose, 50mM Tris-HCL pH 8.0. Incubate on ice 5 min. Add 2 ml cold 0.25M EDTA pH 8.0, incubate 
5 min. at 37* C (waterbath). Place on ice, check percent conversion to spheroplasts by microscopy. In flow 
hood, siowly add 20ml of cold DME/10% sucrose/1 OmM MgCfe (dropwise, ca. 2 drops per second). 

30 Remove media from ceils plated the day before in 6cm dishes (50% confluent). Add 5ml of spheroplast 
suspension to each dish. Place dishes on top of tube carriers in swinging bucket centrifuge. Up to 6 dishes 
can be comfortably prepared at once. Dishes can be stacked on top of each other, but 3 in a stack is not 
advisable as the spheroplast layer on the top dish is often torn or detached after centrifugation. Spin at 
1000xg 10 min. Force is calculated on the basis of the radius to the bottom plate. Aspirate fluid from dishes 

35 carefully. Pipet 1.5-2ml 50% (w/w) PEG1450 (or PEG1000)/50% DME (no serum) into the center of the dish. 
If necessary, sweep the pipet tip around to ensure that the PEG spreads evenly and radially across the 
whole dish. After PEG has been added to the last dish, prop ail of the dishes up on their lids so that the 
PEG solution collects at the bottom. Aspirate the PEG. The thin layer of PEG that remains on the cells is 
sufficient to promote fusion: the layer remaining is easier to wash off, and better cell viability can be 

40 obtained, than if the bulk of the PEG is left behind. After 90 to 120 seconds (PEG 1000) or 120 to 150 
seconds (PEG 1450) of contact with the PEG solution, pipet 1.5ml of DME (no serum) into the center of the 
dish. The PEG layer will be swept radially by the DME. Tilt the dishes and aspirate. Repeat the DME wash. 
Add 3ml of DME/10% serum containing 15 ug/ml gentamicin sulfate. Incubate 4-6 hours in incubator. 
Remove media and remaining bacterial suspension, add more media and incubate 2-3 days. Extensive 

45 washing of the cell layer to remove PEG tends to remove many of the cells without any substantial benefit. 
If the cells are allowed to sit in the second DME wash for a few minutes, most of the spheroplast layer will 
come up spontaneously; however it is preferred to wash briefly and allow the layer to come off in the 
complete medium at 37* C. 

The PEG solution can be conveniently prepared by melting a fresh bottle of PEG at 60 * C and pouring 

so approximate 50 ml aliquots by means of a 50 ml centrifuge tube into preweighed bottles. The aliquoted 
PEG is stored at 5* C in the dark. To make up a fresh bottle, weigh the aliquot, remelt, and add an equal 
volume of DME (no serum). Adjust the pH with 7.5% Na Bicarbonate solution if necessary, and filter 
sterilize. The resulting PEG solution may be stored up to 3 months at room temperature without detectable 
adverse consequence. 

55 Transfected host cells will be cultured according to the invention in order to accomplish expression of 
the protein encoded by the cDNA clone, and to increase the absolute numbers of ceils available for 
subsequent immunoselection. Those skilled in the art will know of appropriate methods and media for this 
purpose, taking into account the cell type and other variables routinely considered, COS cells, for example, 
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may be cultured in Dulbecco's modified . Eagle's medium (DME) supplemented with 10% calf serum and 
gentamycin sulfate. Transient expression of transfected cells normally can be expected between 48 and 72 
hours posttransfection. However, this time period may vary depending upon the type or strain of host cell 
used and the cell culture conditions, as will be apparent to those of ordinary skill. 

5 Immunoprecipitation, blotting, and cDNA sequencing of genes cloned according to the methods of the 
present invention may be carried out by any convenient methods known to those of skill. For example, the 
immunoprecipitation protocol of Clark et ai., Leukocyte Typing II, Vol. II, pp. 155-167 (1986), is preferred. 
Southern, Northern, or other blot analysis methods known to"" those of skill may be employed, using 
hybridization probes prepared by known methods, such as that of Hu et aL ( Gene 18:271-277 (1982)). cDNA 

10 sequencing also may be accomplished by known methods, including the dideoxynucleotide method of 
Sanger et aL. P.N.A.S. (USA) 74:5463-5467 (1977). 

The antibodies used according to the present invention may be polyclonal or monoclonal. These may 
be used singly, or in conjunction with other polyclonal or monoclonal antibodies to effect immunoselection 
of cells expressing the desired antigen or antigens by the methods of the present invention. Methods of 

is preparing antibodies or fragments thereof for use according to the present invention are known to those of 
skill. 

Standard reference works setting forth general principles of immunology include Klein, J., Immunology: 
The Science of Self-Nonself Discrimination , John Wiley & Sons, publisher, New York (1982): Kennett, R. ( et 
al, eds., Laboratory Techniques in Biochemistry and Molecular Biology , Vol. 13, Elsevere, publisher, 
20 Amsterdam (1984). 

The term "antibody" is meant to include the intact molecule as well as fragments thereof, such as, for 
example, Fab and F(ab)' 2 fragments, which also are able to bind to antigen. Polyclonal antibody prepara- 
tions may be derived directly from the blood of the desired animal species after immunization with the 
antigen of interest, or a fragment thereof, using any of the standard protocols known to those of ordinary 
25 skill. Similarly, monoclonal antibodies may be prepared using known methods (Kohler et aL, Eur. J. 
Immunol 6:292 (1976)). Use of monoclonal antibodies is preferred for the purposes of the present invention. 

For the purposes of immunoselection according to the present invention, the tissue culture host cells 
which have been exposed to antibodies directed against the target cell surface antigen are separated from 
host cells which do not express the target antigen by distributing the cells onto a substrate coated with 
30 antibody directed against the antibody for the antigen. This technique, termed "panning," will be known to 
those of skill, and is described, for example, by Mage et aL, Immunol. Meth. 15:47-56 (1977), and 
Wysocki and Sato, P.N.A.S. (USA) 75:2844-2848 (1978). 

Panning according to the methods of the present invention may be carried out as follows: 
a. Antibody-coated dishes. Bacteriological 60mm plates, Falcon 1007 or equivalent, or 10cm dishes 
35 such as Fisher 8-757-12 may be used. Sheep anti-mouse affinity purified antibody (from, for example, 
Cooper BioMedical (Cappell)) is diluted to 10ug/ml in 50mM Tris HCI, pH 9.5. Add 3ml per 6cm dish, or 
10ml per 10cm dish. Let sit ca. 1.5 hrs., remove to next dish 1.5 hrs., then to 3rd dish. Wash plates 3x with 
0.15 NaCI (a wash bottle is convenient for this), incubate with 3ml 1mg/ml BSA in PBS overnight, aspirate 
and freeze. 

40 b. Panning. Cells will be in 60mm dishes. Aspirate medium from dish, add 2ml PBS/0.5mM 

EDTA/0.02% azide and incubate dishes at 37* C for 30 min. to detach cells from dish. Triturate cells 
vigorously with short pasteur pipet, and collect cells from each dish in a centrifuge tube. Spin 4 min. setting 
2.5 (200 x g) (takes 5 min). Resuspend cells in 0.5 -1.) ml PB S/E DTA/azi d e/5% FBS and add antibodies. 
Incubate at least 30 min. on ice. Add an equal volume of P BS/EDTA/azide , layer carefully on 3 ml 

45 P BS/EDTA/azi d e/2% Ficoll, and spin 4 min. at setting 2.5. Aspirate supernatant in one smooth movement. 
Take up cells in 0.5ml P BS/EDTA/azide and add aliquots to antibody-coated dishes containing 3ml 
PBS/EDTA/azida/5% FBS by pipetting through 100 micron Nylon mesh (Tetko). Add cells from at most two 
60mm dishes to one 60mm antibody-coated plate. Let sit at room temperature 1-3 hours. Remove excess 
cells not adhering to dish by gentle washing with PBS/5% serum or with medium. 2 or 3 washes of 3ml are 

so usually sufficient. 

c. Hirt Supernatant. A preferred variant of the method of Hirt, J. Molec. Biol . 26:365-369 (1967), is as 
follows: Add 0.4 ml 0.6% SDS, 10mM EDTA to panned plate. Let sit 20 minutes (can be as little as 1 min. if 
there are practically no cells on the plate). Pipet viscous mixture into microfuge tube. Add 0.1 mi 5M NaC!, 
mix, put on ice at least 5 hrs. Keeping the mixture as cold as possible seems to improve the quality of the 
55 Hirt. Spin 4 min., remove supernatant carefully, phenol extract (twice if the first interface is not clean), add 
10ug linear poly aery lam ide (or other carrier), fill tube to top with EtOH, precipitate, and resuspend in 0.1 mi. 
Add 3 volumes EtOH/NaOAc, reprecipitate and resuspend in 0.1 ml. Transform into MCl061/p3, preferably 
using the high efficiency protocol hereinafter described. If the DNA volume exceeds 2% of the competent 
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cell aliquot, the transformation efficiency will suffer. 5% gives the same number of colonies as 2.5% 
(efficiency is halved). 

It is preferred for this aspect of the present invention to use "blockers" in the incubation medium. 

5 Blockers assure that non-specific proteins, proteases, or antibodies present do not cross-link with or destroy 
the antibodies present on the substrate or on the host cell surface, to yield false positive or false negative 
results. Selection of blockers can substantially improve the specificity of the immunoseiection step of the 
present invention. A number of non-specific monoclonal antibodies, for example, of the same class or 
subclass (isotype) as those used in the immunoseiection step (e.g., IgQi , lgG2A, IgGm, etc.) can be used 

10 as blockers. Blocker concentration (normally 1-100ug/ul) is important to maintain the proper sensitivity yet 
inhibit unwanted interference. Those of skill also will recognize that the buffer system used for incubation 
may be selected to optimize blocking action and decrease non-specific binding. 

A population of cells to be panned for those expressing the target cell surface antigen is first detached 
from its ceil culture dish (harvested) without trypsin. The cells then are exposed to a first antibody, which 

rs may be polyclonal or monoclonal, directed against the antigen of interest or against a family of related 
antigens. At this initial stage, a single antibody or a group of antibodies may be used, the choice depending 
upon the nature of the target antigen, its anticipated frequency, and other variables that will be apparent to 
those of skill. Target antigens expressed on the surfaces of host cells will form an antigen-antibody 
complex. 

20 The cells subsequently are placed in close apposition to a substrate, such as a culture dish, filter disc, 
or the like, which previously has been coated with a second antibody or group of antibodies. This second 
antibody will be directed against the first antibody, and its choice will be a matter of ordinary skill dictated 
by, for example, the animal in which the first antibody was raised. For example, if the first antibody was 
raised in mice, the second antibody might be directed against mouse immunoglobulins, raised in goats or 

25 sheep. Cells expressing the target antigen will adhere to the substrate via the complex formed between the 
antigen, the first antibody, and the second antibody. Adherent cells then may be separated from 
nonadherent cells by washing. DNA encoding the target antigen is prepared from adherent cells by known 
methods, such as that of Hirt, J. Molec. Biol. 26:365-369 (1967). This DNA may be transformed into E coli 
or other suitable host ceils for further rounds of fusion and selection, to achieve the desired degree of 

30 enrichment. 

In the usual case, the initial rounds of immunoseiection will employ a panel of first antibodies directed 
against an epitope or group of epitopes common to the family of antigens to which the target antigen 
belongs. This will be sufficient to narrow the number of clones for future rounds quite significantly. Two 
such rounds usually will be found adequate, but the number of rounds may vary as mentioned above. 

35 Thereafter, a single round of selection may be performed employing a single first antibody or a group of 
first antibodies recognizing only the target antigen. 

By "substrate" is meant a solid surface to which antibodies may be bound for immunoseiection 
according to the present invention. Known suitable substrates include glass, polystyrene, polypropylene, 
dextran, nylon, and other materials. Tubes, beads, microtiter plates, bacteriological culture dishes, and the 

40 like formed from or coated with such materials may be used. Antibodies may be covalently or physically 
bound to the substrate by known techniques, such as covalent bonding via an amide or ester linkage, or by 
absorption. Those skilled in the art will know many other suitable substrates and methods for immobilizing 
antibodies thereupon, or will be able to ascertain such substrates and methods using no more than routine 
experimentation. 

45 The choice of host tissue culture cells for use according to the present invention preferably should be 
such as to avoid the situation in which the antibodies used for panning recognize determinants on 
untransfected cells. Thus, while COS cells are preferred for transient expression of certain surface antigens, 
more preferred are murine WOP cells. Of the latter, WOP 3027 cells are even more preferred. WOP cells 
allow virtually all antibodies to be used, since cross-reactions between murine antibodies and murine cell 

so surface determinants are rare. 

The insert size of the recombinant DNA molecule should be chosen to maximize the likelihood of 
obtaining an entire coding sequence. Those of skill will know various methods by which a preliminary 
determination of optimal insert size for a given gene may be determined. 

55 

Vector construction and cDNA insertion 

Vectors suitable for expression of cDNA in mammalian tissue culture cells may be constructed by 
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known methods. Preferred for the purposes of the present invention is an expression vector containing the 
SV40 origin. The vector may contain a naturally derived or synthetic transcription origin, and the SV40 early 
region promoter. Even more preferred is a chimeric promoter composed of human cytomegalovirus 
immediate early enhancer sequences. Various "enhancer sequences" also may be used with SV40 vectors. 

5 These are described, for example, by Banerji et aL, Cell 27:299-308 (1981); Levinson et aL, Nature 295:568- 
572 (1 982); and Conrad et aL, MoL CelL BjoL 2:949-965 (1 982). 

Insertion of cDNA into the vectors of the present invention can occur, Jot example, by homopoiymeric 
tailing with terminal transferase. However, homopoiymeric tracts located 5' to cDNA inserts may inhibit in 
vitro and in vivo expression. Thus, preferred for purposes of the present invention is the use of inverted 

10 identical cleavage sites separated by a short replaceable DNA segment. Such inverted identical cleavage 
sites, preferably employing the BstXl restriction endonuclease, may be used in parallel with cDNA synthetic 
oligonucleotides, giving the same terminii as the replaceable segment of the vector. In this manner, the 
cDNA cannot ligate to itself, but can ligate to the vector. This allows the most efficient use of both cDNA 
and vector. 

15 Another embodiment of the present invention is the above-described efficient oligonucleotide-based 
strategy to promote cDNA insertion into the vector. The piH3M vector of the present invention is preferred, 
and employs the inverted endonuclease sites. This vector may contain an SV40 origin of replication, but a 
more preferred form contains an M13 origin. This vector, containing the M13 origin, allows high level 
expression in COS cells of coding sequences placed under its control. Also, the small size and particular 

20 arrangement of sequences in the plasmid permit high level replication in COS cells. 

By "cell surface antigen" is meant any protein that is transported through the intracellular membrane 
system to the cell surface. Such antigens normally are anchored to the cell surface membrane through a 
carboxyl terminal domain containing hydrophobic amino acids that lie in the lipid bilayer of the membrane, 
and there exert their biological and antigenic effects. Antigens such as those of T-lymphocytes are 

25 particularly suited for gene cloning by the method of the present invention. However, ceil surface antigens 
of any cells may be cloned according to the present method. Moreover, proteins not normally expressed on 
the cell surface may admit of cloning according to the present method by, for example, using fluorescence 
activated cell sorting (FACS) to enrich for fixed cells expressing intracellular antigens. 

By "substantially pure" is meant any antigen of the present invention, or any gene encoding any such 

30 antigen, which is essentially free Of other antigens or genes, respectively, or of other contaminants with 
which it might normally be found in nature, and as such exists in a form not found in nature. By "functional 
derivative" is meant the "fragments," "variants," "analogs," or "chemical derivatives" of a molecule. A 
"fragment" of a molecule, such as any of the antigens of the present invention, is meant to refer to any 
polypeptide subset of the molecule. A "variant" of such molecules is meant to refer to a naturally occurring 

35 molecule substantially similar to either the entire molecule, or a fragment thereof. An "analog" of a 
molecule is meant to refer to a non-natural molecule substantially similar to either the entire molecule or a 
fragment thereof. 

A molecule is said to be "substantially similar" to another molecule if the sequence of amino acids in 
both molecules is substantially the same, and if both molecules possess a similar biological activity. Thus, 

40 provided that two molecules possess a similar activity, they are considered variants as that term is used 
herein even if one of the molecules contains additional amino acid residues not found in the other, or if the 
sequence of amino acid residues is not identical. As used herein, a molecule is said to be a "chemical 
derivative" of another molecule when it contains additional chemical moieties not normally a part of the 
molecule. Such moieties may improve the molecule "s solubility, absorption, biological half life, etc. The 

45 moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate any undesirable side 
effects of the molecule, etc. Moieties capable of mediating such effects are disclosed, for example, in 
Remington's Pharmaceutical Sciences , 16th ed., Mack Publishing Co., Easton, Penn. (1980). 

Similarly, a "functional derivative" of a gene of any of the antigens of the present invention is meant to 
include "fragments," "variants," or "analogues" of the gene, which may be "substantially similar" in 

50 nucleotide sequence, and which encode a molecule possessing similar activity. 

The substantially pure antigens that have been expressed by methods of the present invention may be 
used in immunodiagnostic assay methods well known to those of skill, including radio-immunoassays 
(RIAs), enzyme immunoassays (ElAs) and enzyme-linked immunosorbent assays (ELISAs). The substan- 
tially pure proteins of the present invention, in soluble form, may be administered alone or in combination 

55 with other antigens of the present invention, or with other agents, including lymphokines and monokines or 
drugs, for the treatment of immune-related diseases and disorders in animals, including humans. As 
examples of such disorders that may benefit from treatment with the substantially pure proteins of the 
present invention may be mentioned immune deficiency diseases, diseases of immediate type hypersen- 
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sitivity, asthma, hypersensitivity pneumonitis, immune-complex disease, vasculitis, systemic lupus 
erythematosus, rheumatoid arthritis, immunopathogenic renal injury, acute and chronic inflammation, 
hemolytic anemias, platelet disorders, plasma and other cell neoplasms, amyloidosis, parasitic diseases, 
multiple sclerosis, Guiliain-Barre syndrome, acute and subacute myopathic paralysis, myasthenia gravis, 

s immune endocrinopathies, and tissue and organ transplant rejection, all as described in Petersdorf et aL, 
eds., Harrison's Principles of Internal Medicine , supra . See also Weir, ed., supra ; Boguslaski et aL, eds. f 
supra ; and Holborow et all eds., supra . 

When used for immunotherapy, the antigens of the present invention may be unlabeled or labeled with 
a therapeutic agent. Examples of therapeutic agents which can be coupled to the antigens of the invention 

iq for immunotherapy are drugs, radioisotopes, lectins, and toxins. 

The dose ranges for the administration of the antigens of the present invention are those large enough 
to produce the desired immunotherapeutic effect, but not so large as to cause adverse side effects, such as 
unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage employed will vary 
with the age, condition, sex, and extent of the disease in the patient Counterindications (if any), immune 

is tolerance and other variables also will affect the proper dosage. Administration may be parenteral, by 
injection or by gradual perfusion over time. Administration also may be intravenous, intraparenteral, 
intramuscular, subcutaneous, or intradermal. 

Preparations for parenteral administration include sterile or aqueous or non-aqueous solutions, suspen- 
sions and emulsions. Examples of non-aqueous solvents include propylene glycol, polyethylene glycol, 

20 vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic and aqueous solutions, emulsions, or suspensions, including saline and buffered media. 
Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, 
lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte 
replenishes, such as those based on Ringer's dextrose, and the like. Preservatives and other additives also 

25 may be present, such as, for example, antimicrobials, antioxidants, chelating agents, inert gases and the 
like. Such preparations, and the manner and method of making them, are known and described, for 
example, in Remington's Pharmaceutical Science , 16th ed., supra . 

The antigens of the present invention also may be prepared as medicaments or pharmaceutical 
compositions comprising the antigens, either alone or in combination with other antigens or other agents 

30 such as lymphokines, monokines, and drugs, the medicaments being used for therapy of animal, including 
human, immune-related indications. 

Although the antigens of the present invention may be administered alone, it is preferred that they be 
administered as a pharmaceutical composition. The compositions of the present invention comprise at least 
one antigen or its pharmaceutical^ acceptable salt, together with one or more acceptable carriers and 

35 optionally other therapeutic agents. By "acceptable" is meant that the agent or carrier be compatible with 
other ingredients of the composition and not injurious to the patient. Compositions include those suitable for 
oral, rectal, nasal, topical (including buccal and sublingual), vaginal, or parenteral administration. The 
compositions conveniently may be presented in unit dosage form, and may be prepared by methods well 
known in the pharmaceutical arts. Such methods include bringing into association the active ingredients with 

40 the carrier which constitutes one or more accessory ingredients. In general, compositions are prepared by 
uniformly and intimately bringing into association the active ingredient with liquid carriers or finely divided 
solid carriers, or both, and shaping the product formed thereby, if required. 

Orally administered pharmaceutical compositions according to the present invention may be in any 
convenient form, including capsules, cachets, or tablets, each containing a predetermined amount of the 

45 active ingredient. Powders or granules also are possible, as well as solution or suspension in aqueous or 
nonaqueous liquids, or oil-in-water liquid emulsions, or water-in-oil liquid emulsions. The active ingredient 
also may be presented as a bolus, electuary or paste. 

Having now described the invention, the same will be more fully understood by reference to the 
following examples, which are not intended in any way to limit the scope of the invention. 

50 

EXAMPLE I Isolation, Molecular Cloning, and Structure of the Human CD2 Antigen 



55 Tne cDNA expression vector piH3 

A COS cell expression vector was constructed from piSV (Little et al., MoL Biol. Med. 1^:473-488 (1983)) 
by inserting a synthetic transcription unit between the suppressor tRNA gene and the SV40 origin. The 
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transcription unit consisted of a chimeric promoter composed of human cytomegalovirus AD169 imme- 
diately eariy enhancer sequences fused to the HIV LTR -67 to +80 sequences. Immediately downstream 
from the LTR +80 sequence was inserted a polylinker containing two BstXI sites separated by a 350bp 
stuffer; the BstXI sites were flanked by Xbal sites, which could also be used to excise the insert, 
s Downstream from the polylinker were placed the SV40 small t antigen splice and early region polyadenyla- 
tion signals derived from pSV2. The nucleotide sequence of the vector is shown in Figure 1 . 



cDIMA library construction 

10 

RNA was prepared from HPB-ALL cells by the guanidinium thiocyanate/CsCI method, as described 
above. PolyA* RNA was prepared from total RNA by oligo dT selection. Maniatis et al, Molecular Cloning: A 
Laboratory Manual , supra . cDNA was synthesized by the method of Gubler and Hoffman, ( Gene 25:263-269 
(1982)). BstXI adaptors were ligated to the cDNA, and the reaction products fractionated by centrifugation 
15 through a 5ml-20% potassium acetate gradient containing ImM EDTA for 3 hours at 50krpm in a SW55 
rotor. 0.5ml fractions were collected manually through a syringe needle or butterfly inserted just above the 
curve of the tube. Individual fractions were ethanoi-precipitated after addition of linear polyacrylamide 
(Strauss and Varshavsky, Cel[ 37:889-901 (1984)) to 20 ug/ml. Fractions containing cDNA larger than 700bp 
were pooled and ligated to gradient purified BstXI digested piH3 vector. 
20 The ligated DNA was transformed into E. coli MC1061/p3 made competent by the following protocol: 
The desired strain was streaked out on an LB plate. The next day a single colony was inoculated into 20ml 
TYM broth (recipes below) in a 250ml flask. The cells were grown to midlog phase (ODgoo about 0.2-0.8), 
poured into a 21 flask containing 100ml TYM, and vigorously agitated until cells grew to 0.5-079 OD, then 
diluted again to 500ml in the same vessel. When the ceils grew to OD 6 oo 0.6, the flask was placed in ice- 
as water, and shaken gently to assure rapid cooling. When the culture was cool, it was spun at 4.2k rpm for 15 
minutes (J6). The supernatant was poured off and the pellet resuspended in about 100ml cold TfB I (below) 
by gentle shaking on ice. Thereafter, it was respun in the same bottle at 4.2krpm for 8 minutes (J6). The 
supernatant was poured off and the pellet resuspended in 20 ml cold TfB II by gentle shaking on ice. 0.1 to 
0.5 ml aliquots were placed in prechilled microfuge tubes, frozen in liquid nitrogen, and stored at -70 a C. 
30 For transformation, an aliquot was removed, thawed at room temperature until just melting, and placed on 
ice. DNA was added, let sit on ice 15-30 minutes, and incubated at 37° C for 5 minutes (6 minutes for 0.5mi 
aliquots). Thereafter the DNA-containing suspensions were diluted 1:10 in LB and grown for 90 minutes 
before plating or applying antibiotic selection. Alternatively, the heat-pulsed transformation mix was plated 
directly on antibiotic plates onto which a thin (4-5ml) layer of antibiotic-free LB agar was poured just before 
35 plating. 

Media and Buffers: TYM: 2% Bacto-Tryptone, 0.5% Yeast Extract. 0.1 M NaCI, 10mM MgsCH (can be 
added before autoclaving). TfB I: 30mM KOAc, 50mM MnCI 2 , 100mM KCL, 10mM CaCI 2l 15% (v/v) 
glycerol. TfB II: 10mM Na-MOPS, pH 7.0, 75 MM CaCI 2 , lOmM KCI, 15% glycerol. 

40 

Recovery of cDNA clones by panning 

Bacteriological culture dishes (Falcon 1007) were prepared for panning by coating with an affinity 
purified sheep anti-mouse IgG antibody as described by Wysocki and Sato ( Proc. Natl. Acad. Sci. USA 

45 75:2844-2848 (1978)), except that dishes were washed with 0.1 5M NaCI from a wash bottle instead of PBS, 
and unreacted sites were blocked by overnight incubation in PBS containing 1mg/ml BSA. Dishes were 
typically prepared in large batches and stored frozen, after aspiration of the PBS/BSA. In the first round of 
screening, 24 6cm dishes of 50% confluent COS cells were transfected by protoplast fusion according to 
the method of Sandri-Goldrin et al., MoJL Cell. BioL 1_:743-752 (1981). 72 hours post fusion the ceils were 

so detached by incubation in PBS/1 mM EDTA/.02% sodium azide at 37° C for 30 minutes. The detached cells 
were pooled, centrifuged, and resuspended in cold PBS/EDTA/5% Fetal Bovine Serum containing mon- 
oclonal antibodies, usually as ascites at 1:1000 dilution, but also as commercial reagents at the concentra- 
tions suggested by the manufactures. After 1 hour on ice, the cells were diluted with 1:1 with 
PBS/EDTA/azide and layered on 10ml of PBS/EDTA/azide containing 2% Ficoll 400. After centrifugation 

55 (400xg, 5 minutes), the supernatant was carefully aspirated, the pellet resuspended in a small amount of 
PBS/EDTA/5% FBS, and the cells distributed into panning plates containing 3 ml of PBS/EDTA/5% FBS. 
The plates were then treated essentially as described by Wysocki and Sato. Proc. Natl. Acad. Sci. USA 
75:2844-2848 (1978). Episomal DNA was recovered from the adherent cells by the Hirt (J. MoL Biol. 26:365- 
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269 (1967)) procedure and transformed into MC1061/p3. 



Cell lines and eel! culture 

5 

COS cell clone M6 cells were propagated in Dulbecco's modified Eagle's medium supplemented with 
10% calf serum and gentamycin sulfate at 15ug/ml (DME/10% calf serum). Cells were split the day before 
transfection in 6 cm dishes at approximately 1 :8 ratio from stock plates kept as dense as possible without 
overtly affronting the cells. T cell lines were grown in Iscove's modification of Dulbecco's medium (IMDM) 
10 containing gentamycin as above, and either NuSerum (Collaborative Research) or fetal bovine serum at 
10%. 



COS cell transfection for immunofluorescence studies 

15 ~~ 

COS cells at 50% confluence in 6 cm dishes were transfected in a volume of 1.5ml with a cocktail 
consisting of DME or IMDM medium containing 10% NuSerum (Collaborative Research), 400ug/ml DEAE 
Dextran, 10uM chloroquine diphosphate, and 1ug/mg DNA. After 4 hours at 37 "C (or earlier if the cells 
appeared ill), the transfection mix was removed and the cells were treated with 10% DMSO in PBS for 2 

20 minutes. Sussman and Milman, Cell Biol. 4:1641-1643 (1984), Cells were then returned to DME/10% calf 
serum for 48 to 72 hours to allow expression. 



Immunoprecipitations, Northerns and Southerns 

25 

T cells were labeled by lactoperoxidase treatment, lysed, and immunoprecipitated by the procedure of 
Clark and Einfeld ( Leukocyte Typing II, Vol. II, pp. 155-167 (1986)), using commercially available goat anti- 
mouse IgG agarose beads (Cooper Biomedical). COS cells were transfected with DEAE Dextran method 
and trypsinized and passed without dilution into new plates 24 hours after transfection. 36 hours later, cells 

30 were detached by exposure to PBS/EDTA as above, centrifuged, and labeled by the lactoperoxidate 
method. A cleared lysate was prepared as for the T cell immunoprecipitations, except that the lysis buffer 
contained 1mM PMSF, and incubation with the primary antibody was carried out for only 2 hours at 4"C. 
Eluted samples were fractionated on discontinuous 11.25% polyacrylamide gels using the buffer system of 
Laemmii ( Nature 227 :680-685 (1970)). 

35 Northern blot analysis was carried out essentially as described (Maniatis et al., Molecular Cloning, a 
Laboratory Manual (1982)), except that DMSO was omitted from the loading buffer, denaturation was at 
70 C for 5 minutes, and the gel contained 0.6% formaldehyde rather than 6%. The gel was stained in two 
volumes of water containing 1 ug/ml ethidium bromide, photographed, and transferred to nylon (GeneScreen, 
DuPont) in the staining liquor. The transferred RNA was irradiated by exposure to a germicidal lamp through 

40 Saran Wrap (Church and Gilbert, Proc. Natl. Acad. Sci. USA 8:1991-1995 (1984)) for 5 minutes at a flux 
(measured at 254nm) of 0,22m W/cm 2 . Southern blot analysis was carried out by alkaline transfer to nylon 
(GeneScreen, DuPont) as described by Reed and Mann ( Nuci. Acids Res. 13:7207-7221 (1986)). Hybridiza- 
tion probes were prepared by the method of Hu and Messing ( Gene 18:271-277 (1982)), and blots were 
prehybridized in SDS/phosphate buffer (Church and Gilbert, Proc. Natl. Acad. Sci. USA 8:1991-1995 (1984)) 

45 containing 10 DNA microgram equivalents of M13 mp19 phage. 



Erythrocyte Rosetting 

50 Erythrocytes were prepared from whole blood by three centrifugations in PBS. COS ceils were 
transfected in 6 cm dishes with CD2 or other surface antigen expression clones by the DEAE method. 48 to 
72 hours posttransfection, the medium was aspirated and 2ml of PBS/5% FDS/azide was added to each 
plate, followed by 0.4 ml of the appropriate erythrocyte samples as 20% suspensions in PBS. After 1 hour 
at room temperature, the nonadherent erythrocytes were gently washed off, and the plates examined. 

55 A cDNA encoding CD2 antigen determinants was isolated in the following manner: cDNA was prepared 
from RNA extracted from the human T Cell tumor line HPB-ALL and inserted into the SV40 origin-based 
expression vector piH3 as described above. A cDNA library of approximately 3 x 10 s recombinants was 
constructed, and the library was introduced into COS cells by protoplast fusion. Three days later the cells 
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were detached by exposure to EDTA and treated with a pool of monoclonal antibodies, including three 
(OKT11, Leu5b, and Coulter T11) directed against CD2 determinants. The antibody-treated cells were 
distributed into dishes coated with an affinity purified sheet anti-mouse IgG antibody, allowed to attach, and 
separated from the nonadherent cells by gentle washing. This method of enrichment is known in the 

5 immunological literature {Mage et a}., <L Immunol. Methods 15:47-56 (1 977). 

The resulting colonies were pooled, fused into COS cells, and subjected to a second round of panning 
as before. In the third round, a portion of the detached cells was treated with a mixture of three monoclonal 
antibodies specific for CD2, and a Hirt supernatant was again generated and transformed into E. coli. DNA 
was prepared from eight of the resulting colonies and transfected into COS cells. After three days, surface 

10 expression of the CD2 antigen was detected by indirect immunofluorescent in six of eight transfected 
dishes. Restriction enzyme digestion of the corresponding plasmid DNAs revealed a 1.5kb insert in all six 
isolates. 

One of the six clones was prepared in larger quantities for further analysis. Following transfection into 
COS cells, indirect immunofluorescence analysis with a partial panel of antibodies provided by the Third 
is International Workshop on Leukocyte Differentiation Antigens showed that ail of the antibodies provided 
gave positive reactions with the exception of one sample which also failed to react with phytohemagglutinin- 
activated T lymphocytes. Among the 17 antibodies tested were at least eight distinguishable groups defined 
by their differing patterns of reactivity with lymphocytes of various primate species. Jonker and Nooij, 
Leukocyte Typing II, Vol. I, pp. 373-387 (1986). 

20 

cDNA sequence analysis 

The CD2 cDNA insert was subcloned into M13 mpl9 (Vieira and Messing, Gene 19: 259-268 (1982)) in 

25 both orientations, and the sequence determined by the dideoxynucleotide method (Figure 2). Sanger et ai., 
Proc. Natl. Acad. Sci. USA 74: 5464-5467 (1977). An open reading frame was observed to extencT3*60 
reildueTfrom an ATG triplet satisfying the consensus criteria of Kozak (Microbiol. Rev. : 1-47:45 (1983)) for 
translational initiation codons (Figure 1). The predicted amino acid sequence evokes an integral membrane 
protein with a single membrane spanning hydrophobic anchor terminating in a rather large intracytoplasmic 

30 domain. Comparison of the N-terminal amino sequence with the matrix of signal sequence residue 
frequencies constructed by von Heijne ( Nucl. Acids Res. 14:4683-4690 (1986)) suggests that mature CD2 
peptide is formed by cleavage of a precursor peptide between the 19th (Ser) and 20th (Lys) residues. 

A surprising and unexpected feature of this sequence is the presence of a potential N-Jinked 
glycosylation site just proximal to the proposed cleavage site. The resulting polypeptide backbone has a 

35 predicted molecular weight of 38.9kd divided into an external domain of mass 21 .9kb and a cytoplasmic 
domain of mass 1 .6kd. Three N-linked glycosylation sites are present in the extra-cellular domain. 

The membrane spanning domain comprises 26 unchanged residues of predominantly hydrophobic 
character. In the nine residues immediately following are seven basic residues, either lysines or arginines. 
The appearance of predominantly hydrophobic residues followed by basic residues is a common organiza- 

40 tiona! feature of transmembrane proteins bearing carboxyl-terminal anchors. 

Another surprising feature of the transmembrane domain is the appearance of a cys-giy-gly-gly, a beta 
turn motif (Chou and Fasman, Annual Review of Biochemistry , 47:251-276 (1978)), flanked by hydrophobic 
residues (which are frequently found flanking beta turns). Because only 20 residues arrayed in an alpha 
helix are theoretically needed to traverse the 3nm membrane bilayer (Tanford, Science , 200:1012-1018 

45 (1978)), and as few as 14 hydrophobic residues can allow insertion and export of an integral membrane 
protein (Adams and Rose, Ceil, 41:1007-1015 (1985)), the transmembrane segment of the CD2 antigen may 
contain a bend or king. 

The rather large size of the cytoplasmic domain leaves open the possibility that CD2 possesses an 
intrinsic enzymatic activity. The cytoplasmic domain is very rich in prolines and contains three sites with 
50 high turn probability. 

Comparison of the amino acid sequence with the NBRF database revealed no substantive homologies 
with other proteins. In particular, no homology with the T cell receptor alpha or beta chains was observed, 
ruling out the suggestion that CD2 is a primordial T cell receptor. Milanese et al., Science , 231:1118-1122 
(1986). 

55 Just inside the cytoplasmic face of the protein is a run of basic proteins followed by a serine residue, a 
pattern found at the same location in both the EGF receptor and the class I histocompatibility genes, and in 
each case a known site for either in vivo (EGF) and in vitro (HLA) phosphorylation by protein kinase C or 
cyclic AMP-dependent protein kinase, respectively. Hunter et aL, Nature , 311:480-482 (1984); David and 
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Czech (1985); Guild and Strominger (1984). A similar site is found in the intracytoplasmic domain of the 
interieukin 2 receptor, and is phosphoryiated in vivo by protein kinase C. Leonard et aL, Nature , 311:626- 
631 (1984); Nikaido et aL, Nature , 31^:631-635 (1984); Shackelford and Trowbridge, J. BioL Chem. , 
259:11706- (1984). 

5 

Immunoprecipitation of CP2 antigen expressed by transfected cells 

COS cells were transfected with the CD2 expression piasmid and surface labeled with 125| by the 
10 lactoperoxidase method 60 hours post-transfection. A cell lysate was prepared, and portions were incubated 
either with monoclonal anti-CD2 antibody (OKT11) or with an extraneous (OKT4; anti-CD4) antibody for 2 
hours at 4*C. Sepharose-bound anti-mouse antibody was added, and after several washing steps, the 
adsorbed proteins were eluted and electrophoresed through a 1 1 .25% acrylamide gel together with similarly 
prepared immunoprecipitates from phytohemaggiutinin-activated T lymphocytes, the cDNA donor line HPB- 
15 ALL, or a long-term T cell line generated in this laboratory. Autoradiography demonstrated a prominent 
band of immunoreactive material precipitated from transfected COS cells by the anti-CD2 antibody, but not 
by the control. The calculated mean molecular weight of the COS cell material was 51 kd, compared to a 
mean molecular weight of 54kd for the T blast and T cell line material; the antigen from HPB-ALL cells was 
found to have a molecular weight of approximately 61 kd. The observed differences in size were attributed to 
20 different patterns of glycosyiation in the different ceil types. A minor band of apparent molecular weight 
38kd was present in material immunoprecipitated from COS cells but not from T cells or HPB-ALL cells. 
The size of this species agrees within experimental error with the predicted molecular weight of mature 
unglycosylated peptide, 39kd. 

25 

COS cells expressing CD2 form rosettes with sheep erythrocytes 

COS cells transfected with the CD2 expression clone were treated for 1 hour with purified MT910 (IgG, 
kappa) anti-CD2 antibody (Rieber et ah, Leukocyte Typing ii, Vol. I, pp. 233-242 (1986)) at a concentration 

30 of 1ug/ml, or with purified MG40.5 (IgGl, kappa; Kawata et aL, J. Exp. Med. , 160:633-651 (1984)) antibody at 
the same concentration. MB40.5 recognizes a monomorphic H LA- ABC determinant and cross-reacts with 
African Green Monkey histocompatibility antigens; it was chosen because it represents an isotyps-matched 
antibody recognizing a surface antigen of approximately the same abundance as the CD2 antigen 
expressed by transfected cells. Sheep erythrocyte rosettes were observed in the presence of MF40.5, but 

35 not of MT910. Rosette inhibition was also observed with OKT11 antibody, and not with various other control 
antibodies. 



Transfected COS cells form rosettes with other animal erythrocytes 

40 ~ ~~ ~~~ 

In addition to sheep erythrocytes, human T cells are known to form rosettes with horse, pig, dog, goat, 
and rabbit, but not mouse or rat erythrocytes. Johansen et aL, J. Allergy Clin. Immunol. , 54:86-94 (1974); 
Amiot et aL in, A. Bernard et aL, eds., Leucocyte Typing , Springer, publisher, New York, N.Y., pp. 281-293 
(1984); Nalet and Fournier, CelL Immunol , 96:126-136 (1985). Autorosettes between human erythrocytes 

45 and human thymocytes (Baxley et aL, Clin. Exp. Immunol. , 15:385-393 (1973)) have also been reported. 
COS cells transfected with the CD2 expression clone were treated with either MT910 or with the control 
antibody, MB40.5, and exposed to erythrocytes from the species above. Rosettes were observed with 
horse, pig, dog, goat, sheep, rabbit, and human erythrocytes, but not with mouse or rat erythrocytes. 
Rosette formation was blocked by pretreatment of transfected COS cells with MT910, but not with MB40.5. 

so In these experiments, it was noticed that horse erythrocytes formed unusually dense rosettes, and that goat 
erythrocytes formed rather sparse rosettes, possibly because their small size made them more susceptible 
to washing. Mouse erythrocytes showed weak spontaneous binding to the culture dish as well as to MT910 
and MB40.5 pretreated cells, while rat erythrocytes showed no detectable binding of any sort. 

55 

Binding of human erythrocytes is blocked by LFA3 antibody 

Because it has been suggested on the basis of antibody blocking studies that LFA3 is the target 
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structure for the CD2 antigen (Shaw et al., Nature , 323:262-264 (1986)), the ability of anti-LFA3 antibody to 
prevent rosette formation was investigated. Transfected ceils were exposed to human erythrocytes pretreat- 
ed for 2 hours with either anti-LFA3 (IgGI, kappa) as ascites at 1:1000 dilution, or with a 10ug/ml 
concentration of each of four isotype-matched nonagglutinating antibodies directed against human eryth- 
5 rocyte antigens as prevalent or more prevalent than LFA3:G10/B11 and D10, anti-K14 antigen, D6, anti-Wr b 
antigen; and F7/B9, anti-k antigen. Nichols et al., Vox Sang , in press. The erythrocytes were washed free of 
excess LFA3 antibody, but were allowed toTorm rosettes in the presence of the control antibodies to guard 
against possible loss of antibody blocking power by desorption. Rosette formation was observed in the 
presence of all four control antibodies, but not with erythrocytes pretreated with anti-LFA3. 

w 

COS cells expressing other T cell antigens do not form rosettes 

A number of clones were isolated by the same expression technique used to clone CD2 and 
15 characterized to varying degrees by antibody reactivity, nucleic acid restriction and sequence analysis, and 
immunoprecipitation. Representative clones were transfected into COS cells and analyzed for ability to 
sustain rosette formation. The CD1a, CD1b, CD1c, CD4, CDS, CD6, CD6, CD8, and CD27 (Tp44) clones did 
not form rosettes with human erythrocytes. 

20 

RNA blot analysis 

Equal amounts of total RNA prepared from cell types expressing or lacking CD2 antigen were 
electrophoresed through denaturing agarose gels and transferred to nylon. Hybridization of the transferred 

25 RNA with a strand selective probe (Hu and Messing, Gene , 17:271-277 (1982)) prepared from an M13 clone 
containing a CD2 cDNA insert revealed the presence of prominent 1.65 and 1.3 kb transcripts present in 
RNA derived from thymocyte, activated T cell, and senescent T cell populations. Lesser amounts were 
found in RNA extracted from cDNA donor line, HPB-ALL and less still from MOLT4; barely detectable levels 
were recorded in RNA from the HSB-2 line. No reactivity was observed with RNA from Namalwa (Burkitt 

30 lymphoma), U937 (histiocytic leukemia), HuT-78 (Adult T ceil leukemia), PEER (T cell leukemia), or Jurkat 
clone J3R7 (T cell leukemia) lines. The pattern genomic alteration underlies the failure of the examined T 
cell tumor lines to express CD2 antigen. Restriction analysis of total genomic DNA with a number of other 
enzymes, as well as preliminary results with an incomplete collection of 1 phage recombinants bearing the 
CD2 sequence, shows that the gene is divided into at least four exons. 

35 

EXAMPLE II Isolation and Molecular Cloning of Human LFA-3 Antigen 

The previous example shows that cDNAs encoding surface antigens, such as the CD2 antigen, can be 

40 isolated by the transient expression system of the present invention, in which COS cells transfected with 
cDNA libraries are allowed to attach to ("panned" on) antibody-coated plates. Plasmid DNA is recovered 
from cells adhering to the plates, transformed into E. coli, and the process is repeated, usually twice, to 
isolate the desired clone. Although powerful, this approach cannot be used when the monoclonal antibodies 
used for panning recognize determinants on the untransfected cells. This appears to be the case for anti- 

45 LFA3 monoclonal TS2/9. However, a similar transient expression system based on polyoma virus 
replication-competent cells should allow almost all monoclonals to be used, since the probability of cross 
reaction between murine antibodies and murine cells surface determinants should usually be small. 

A new expression vector, CDM8 (Figure 3) was created from the COS cell vector piH3M described 
previously. The new vector differs by the inclusion of a deleted version of a mutant polyoma virus early 

so region selected for high efficiency expression in both murine and monkey cells, by the replacement of 
substantially all of the human immunodeficiency promoter region with the cognate sequences of the human 
cytomegalovirus immediate early promoter, and by inclusion of a bacteriophage T7 promoter between the 
eukaryotic promoter and the side of cDNA insertion. Expression in COS cells of chloramphenicol acetyltran- 
sf erase by all of the vectors was equivalent. 

55 A library of 1.9 x 10 s recombinants having inserts greater than 0.8kb in size was prepared in the CDM8 
vector from a microgram of poly A+ RNA isolated from the human lymphoblastoid cell line JY. The library 
was introduced into WOP cells (NIH 3T3 cells transfected with polyoma origin deletion DNA) by spheroplast 
fusion, and subjected to three rounds of panning and reintroduction into E. coli as described in Example I. 
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A clone encoding the LFA-3 antigen was identified by indirect immunofluorescence of transfected WOP 
cells, amplified and sequenced (Figure 4). Within the 874 bp insert, an open reading frame of 237 residues 
originates at a methionine codon closely corresponding to the consensus sequence suggested by Kozak, 
Microbiol. Rev. , 47:1-45 (1983). The reading frame terminates in a series of hydrophobic residues lacking 

5 the characteristic basic anchoring residues of internal membrane proteins, but sharing features with known 
phosphatidylinositol-linked superficial membrane proteins. The features include clustered serine or threonine 
residues in a hydrophilic region immediately preceding the hydrophobic domain, and the presence of 
serines and threonines in the hydrophobic portion. 

The amino acid sequence predicted from the nucleotide sequence of the LFA-3 clone was compared to 

io the NBRF database, and no significant homologies were uncovered; the most significant scores were to the 
HIV envelope protein. Within the 200 residues comprising the presumed mature protein are 6 N-linked 
glycosylation sites, and 5 tandem serine or tandem threonine residues that frequently appear in O-iinked 
glycosylated proteins. Ten cysteine residues appear in the complete sequence, 6 of which are distributed in 
the latter half of the mature protein, and one of which fails in the car boxy-terminal hydrophobic domain. 

15 Although esterification of cysteine thiols to fatty acids is a common occurrence in integral membrane 
proteins, and may play an alternate role in membrane anchoring of LFA-3, two examples are known of 
cysteine residues within or' at the margin of the hydrophobic region of phosphatidylinositol linked proteins. 

The predicted sequence suggests that the known manipulations for increasing erythrocyte adhersion to 
T ceils may find direct physical explanation in the structure of the LFA-3 molecule. 

20 Aminoethylisothiouronium bromide, the thiourea adduct of bromoethylamine, undergoes spontaneous rear- 
rangement to mercaptoethylguanidine at alkaline pH. The latter likely gains access to disulfide bonds 
inaccessible to less chaotropic reducing agents and may thereby reduce and promote the unfolding of the 
LFA-3 molecule. Similarly, neuraminidase may decrease steric interference by the many carbohydrate 
chains on the molecule. 

25 RNA and DNA blot hybridization analysis showed that the LFA-3 gene shares no closely related 
sequences in the genome, and encodes a single RNA species of about 1 kb in length. Cell lines that 
express large amounts of surface LFA-3 have greater amounts of LFA-3 RNA than those that express small 
or nondetectable amounts. 

Radioimmunoprecipitation of the antigen expressed in transfected COS and murine cells shows a broad 

30 band of approximately 50kd mean molecular mass, similar to that found in JY cells. 



EXAMPLE ill Isolation and Molecular Cloning of the Human CD28 cDNA Antigen 

35 The previous examples illustrate the monoclonal antibody-based technique of the present invention for 
enrichment of cDNAs encoding surface antigens. In the present example, a method of constructing plasmid 
expression libraries is described which allows the enrichment technique to be fully exploited. The method of 
the present invention for making plasmid expression libraries is of general use for expression cloning. 

The antibody selection technique of the present invention has also been applied to isolate a cDNA 

40 clone encoding the CD28 antigen. The antigen shares substantial homology with members of the im- 
munoglobulin superfamily and forms a dimer structure on the surface of transfected COS cells similar to the 
dimer structure found on T lymphocytes. 



45 pre P aration oj cDNA Libraries. 

Poly(A)+ RNA was prepared from the human T-cell tumor line HPB-ALL by oligo(dT) cellulose 
chromatography of total RNA isolated by the guanidinium thiocyanate method (Chirgwin, J.M. et al., 
Biochemistry , 1 8:5294-5299 (1 979)). cDNA was prepared by a protocol based on the method of Gubler and 

so Hoffman (Gubler, U. et aL, Gene , 25:263-269 (1982)). 4ug of mRNA was heated to approximately 100° C in 
a 1 .5 ml centrifuge tube for 30 seconds, quenched on ice, and the volume adjusted to 70ul with RNAse-free 
water. To this were added 20ul of buffer (0.25 M Tris pH 8.8 (8.2 at 42 *C) 0.25 M KCI, 30 mM MgCfe), 2ul 
of RNAse inhibitor (Boehringer 38 u/ul), 1 ul of 1M DTT, lul of 5ug/ul of oligo dT (Collaborative Research), 
2ul of 25 mM each deoxynucleoside triphosphate (US Biochemicals), and 4ul of reverse transcriptase (Life 

55 Sciences, 24u/ul). After 40 minutes at 42* C, the reaction was terminated by heating to 70 °C for 10 
minutes. To the reaction mix was then added 320ul of RNAse free water, 80ul of buffer (0.1 M Tris pH 7.5, 
25 mM MgCI 2 » 0.5 M KCI, 0.25 mg/ml BSA, and 50 mM (DTT), 25 units of DNA Polymerase I (Boehringer), 
and 4 units of RNAse H (BRL). After 1 hour at 15* C and 1 hour at 22* C, 20 ul of 0.5M EDTA pH 8.0 were 
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added, the reaction mixture was extracted with phenol, NaCI was added to 0.5 M, linear polyacrylamide 
(carrier; Strauss. F. et aL, Cell, 37:889-901 (1984)) was added to 20 ug/ml, and the tube was filled with 
ethanol. After centrifugation for 2-3 minutes at 12,000 x g, the tube was removed, vortexed to dislodge 
precipitate spread on the wall of the tube, and respun for 1 minute. 

5 Unpurifled oligonucleotides having the sequence CTCTAAAG and CTTTAGAGCACA were dissolved at 
a concentration of 1 mg/ml, MgSO* was added to 10 mM, and the DNA precipitated by adding 5 volumes 
of EtOH. The pellet was rinsed with 70% ETOH and resuspended in TE at a concentration of 1 mg/ml. 25 ul 
of the resuspended oligonucleotides were phosphoryiated by the addition of 3 ul of buffer (0.5 M Tris pH 
7.5, 10 mM ATO, 20 mM DTT, mM spermidine, 1 mg/ml BSA, and 10 mM MgCl 2 ) and 20 units of 

10 polynucleotide kinase followed by incubation at 37° C overnight. 

3 ul of the 12-met and 2 ul of the 8-mer phosphoryiated oligonucleotides were added to the cDNA 
prepared as above in a 300 ul reaction mixture containing 6 mM Tris pH 7.5, 6 mM MgCfe, 5 mM NaCI, 
0.35 mg/ml BSA, 7 mM mercaptoethanoi, 0.1 mM ATP, 2 mM DTT, 1 mM spermidine and 400 units T4 
DNA ligase (New England BioLabs) at 15* overnight. 10 ul of 0.5 M EDTA were added, the reaction was 

15 phenol extracted, ethanol precipitated, resuspended in a volume of 100 ul and layered on a 5 ml gradient of 
5-20% potassium acetate in 1 mM EDTA, 1 ug/ml ethidium bromide. The gradient was spun 3 hours at 
50,000 rpm (SW55 rotor) and fractionated manually, collecting three approximately 0.5 ml fractions followed 
by six approximately 0.25 fractions in microcentrifuge tubes by means of a butterfly infusion set inserted 
just above the curve of the tube. Linear polyacrylamide was added to 20 ug/ml, the tubes were filled with. 

20 ethanol, chilled, spun, vortexed and respun as above. The precipitate was washed with 70% ethanol, dried, 
and resuspended in 10 ul. 1 ul of the last 6 fractions was run on a gel to determine which fractions to pool, 
and material less than 1 kb in size was typically discarded. Remaining fractions were pooled and ligated to 
the vector. 

The complete sequence and derivation of the vector is shown in Figure 5. The vector was prepared for 
25 cloning by digestion with BstXI and fractionation on 5-20% potassium acetate gradients as described for the 
cDNA. The appropriate band was collected by syringe under 300 nm UV light and ethanol precipitated as 
above. cDNA and vector were titrated in test ligations. Usually 1-2 ug of purified vector were used for the 
cDNA from 4 ug of poly A+ RNA. The ligation reactions were composed as described for the adaptor 
addition above. The ligation reactions were transformed into MC1061/p3 cells made competent as de- 
30 scribed above. The transformation efficiency for supercoiled vector was 3-5x1 0 8 colonies/ug. 



Recovery and characterization of the CD28 clone. 

35 Panning of the library was carried out as described herein above, using purified antibody 9.3 (DuPont) 
at a concentration of 1 ug/ml in the antibody cocktail. The methods used for COS cell transection, 
radioimmunoprecipitation, RNA and DNA blot hybridization, and DNA sequencing were ail as described 
herein above. 

To isolate the CD27 cDNA, a large plasmic cDNA library was constructed in a high efficiency 

40 expression vector containing an SV40 origin of replication. A preferred version of the vector, containing an 
M1 3 origin, is shown in Figure 6. Three features of the vector make it particularly suitable for this use: (i) 
the eukaryotic transcription unit allows high level expression in COS cells of coding sequences placed 
under its control; (ii) The small size and particular arrangement of sequences in the plasmid permit high 
level replication in COS cells; and (iii) the presence of two identical BSTX I sites in inverted orientation and 

45 separated by a short replaceable fragment allows the use of an efficient oligonucleotide-based strategy to 
promote cDNA insertion in the vector. 

The BstXI cleavage site, CCAN'sNTGG, creates a four base 3' extension which varies from site to site. 
A vector was created in which two identical sites were placed in inverted orientation with respect to each 
other, and separated by a short replaceable segment of DNA. Digestion with BstXI followed by removal of 

so the replaceable segment yielded a vector molecule capable of ligating to fragments having the same ends 
as the replaceable segment, but not to itself. In parallel, cDNA synthetic oligonucleotides were employed 
that give the same termini as the replaceable segment. The cDNA then could not ligate to Itself, but could 
ligate to the vector. In this way, both cDNA and vector were used as efficiently as possible. 

Tailing with terminal transferase achieves the same end, but with less convenience and less overall 

55 efficiently. Moreover, homopolymer tracts located 5 to cDNA inserts have been reported to inhibit 
expression in vitro and invivo (Yokota, T. ( et al., Nuci. Acids Res. 14:151 1-1524 (1986); Riedel, H., EMBO J. 
3:1477-1483 (1985)). Similar approaches based on the use of partially filled restriction sites to favor 
insertion of genomic DNAs (Zabarovsky, E.R., et al., Gene 42:119-123 (1986)) and cDNAs (Yang, Y., et al., 
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Cell 47:3-10 (1986)) recently have been reported. These approaches give 2 or 3 base complementary 
termini, which usually ligate less efficiently than the 4 base extensions reported here. 

Although the cloning scheme of the present invention does not result in a directional insertion of the 
cDNA, the ability to make large libraries easily, coupled with a powerful selection procedure, mades 

5 directional insertion unnecessary. The library construction efficiencies observed according to the present 
invention, between 0.5 and 2x10 s recombinants per ug of mRNA, with less than 1% background and an 
insert size greater than 1 kb, compared favorably with those described for phage vectors lambda gt1 0 (7.5 x 
10 s d/ug of mRNA) and lambda gt11 (1.5 X 10 6 /ug of mRNA) (Huynh, T„ et al.„ In: DNA Cloning Vol I, A 
Practical Approach , Glover, D.M. fed.), IRL Press, Oxford (1985). pp. 49-78); but the resulting clones were 

w more convenient to manipulate. 

Surface antigen CDNAs can be isolated from these libraries using the antibody enrichment method of 
the present invention. In this method, the library is introduced into COS cells (for example, by spheroplast 
or protoplast fusion), where it replicates and expresses its inserts. The cells are harvested by detaching 
without trypsin, treated with monoclonal antibodies specific for the surface antigens desired, and distributed 

75 in dishes coated with affinity purified antibody to mouse immunoglobulins. Cells expressing surface antigen 
adhere, and the remaining cells can be washed away. From the adherent cells, a Hirt fraction is prepared 
(Hirt, B., J. Molec. Biol. 26:365-369 (1967)), and the resulting DNA transformed back into E coli for further 
rounds of fusion and selection. Typically, after two rounds of selection with monoclonal antibodies 
recognizing different surface antigens, a single round of selection is performed with a single antibody, or 

20 pool of antibodies recognizing the same antigen. 



Isolation of a CD28 cDNA. 

25 The CD28 cDNA was isolated from a library of about 3 X 10 s recombinants prepared from cDNA from 
0.8ug of poly A* RNA using an earlier version of the protocol described in the Materials and Methods. The 
library was screened for CD28 (and other surface antigen) cDNA clones by the method outlined above. 
After the third transfection, COS cells were panned with the 9.3 antibody alone. A Hirt supernatant was 
prepared from the adherent cells and transformed into E col[. Plasmid DNA was isolated form eight 

30 colonies and transfected Individually into COS ceil cultures. The presence of the DC28 antigen was 
detected in three of eight transfected cultures by indirect immunofluorescence. All three plasmid DNAs 
contained an insert of about 1,5 kb. 



35 cDNA sequence analysis. 

The CD28 cDNA encodes a long open reading frame of 220 residues having the typical features of an 
integral membrane protein (Figure 17). Removal of a predicted (von Heijne, Nucl. Acids Res. 14:4683-4690 
(1986)) N-terminal signal sequence gives a mature protein of 202 residues comprising an extracellular 

4Q domain with five potential N-linked glycosylation sites (Asn-X-Ser/Thr), a 27-amino acid hydrophobic 
membrane spanning domain, and a 41 -amino acid cytoplasmic domain. Comparison of the amino acid 
sequence of CD28 with the National Biomedical Research Foundation database (Version 10.0) revealed 
substantial homology with mouse and rabbit immunoglobulin heavy-chain variable regions over a domain 
spanning almost the entire extracellular portion of CD28. Within this domain two cysteine residues in the 

45 homology blocks Leu-(Ser or Thr)-Cys and Tyr-{Tyr or Phe)-Cys are shared by CD28, CD4, CD8, 
immunoglobulin heavy- and light-chain variable sequences and related molecules with approximately the 
same spacing (Maddon et aL, Annu. Rev. Biochem. 48:961-997 (1979)). 



50 CD28 cDNA directs the production of a homodimer in transfected COS cells. 

[mmunoprecipitation of CD28 antigen from transfected COS cells was carried out using the monoclonal 
antibody 9.3 (Hansen, J.A., et at, Immunogenetics 10:247-260 (1980)). The material obtained from COS 
cells migrated with a molecular weight of 74kd under nonreducing conditions and 39 kd under reducing 
55 conditions, a pattern consistent with homodimer formation. Under the same conditions activated T cells give 
bands with molecular weights of 87 and 44kd, and HPB-ALL ceils give bands of 92 and 50 kd, under 
nonreducing and reducing conditions respectively. The variation in molecular weight of the material 
obtained from different cell types arises as a result of differing glycosylation patterns characteristic of each 
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type. Similar results were observed with other leukocyte surface antigens (Seed et al, Proc. Natl. Acad. Sci 
USA 87 (1987)). The nucleotide sequence of the CD28 cDNA predicts a mature protein with molecule 
weight of 23 kd, much smaller than observed in these experiments, and probably attributable to utilization of 
the 5 N-linked glycosylation sites predicted by the amino acid sequence. 



RNA blot analysis. 

Equai amounts of total RNA prepared from cell types expressing or lacking CS28 were subjected to 
to RNA blot analysis as described hereinabove. Four bands with molecular weights of 3.7, 3.5, 1 .5, and 1 .3 kb 
were visible in lanes containing RNA thymocytes, T blasts, senescent T cells, and the T cell leukemia cell 
lines PEER and HPB-ALL. No bands were detected in lanes containing RNA prepared from the cell lines 
U937 (histiocytic leukemia), HuT-78 (Adult T cell leukemia), Jurkat (T cell leukemia), Namalwa (Burkitt 
lymphoma), MOLT4, and HSB-2, all of which do not express CD28. The 1.5 kb transcript presumably 
rs corresponds to the isolated cDNA, and the 3.7 and 3.5 kb species reflect incomplete splicing or alternative 
polyadenylation site utilization. The 1.3 kb transcript may terminate at an unconventional polyadenylation 
signal, since there is no obvious candidate in the sequence. 



20 The CD28 gene is not rearranged. 

DNA blot analysis (Seed et al., Proc. Natl. Acad. Sci USA 87 (1987)) of genomic DNA from placenta, 
peripheral blood lymphocytes, T cells, HeLa cells, or the tumor lines used in the RNA blot analysis above 
showed identical Dra 1 digest patterns Indicating that rearrangement is not involved in the normal 
25 expression of the CD28 gene during development. Similarly, no gross genomic rearrangement underlies the 
failure of the examined T-cell tumor lines to express CD28 antigen. It may be inferred from the Dra 1 
fragment pattern that the CD28 gene contains at least two introns. 



30 EXAMPLE IV Isolation and Molecular Cloning of Two Human CD7 Antigen cDNAs 

The C07 cluster of antibodies (Paiker, et aL, Leukocyte Typing II, Springer-verlag, New York, 303-313 

(1985) ) recognized a 40 kd glycoprotein <gp40) on the surface of peripheral blood T cells and thymocytes. 
Early studies with anti-CD7 antibodies showed that CD7* T cells enhance immunoglobulin (ig) synthesis by 

35 B cells (Miroshima et aL, J. Immunol . 129:1091-1098 1982)), suppress B cell Ig synthesis when stimulated 
with Concanavalin A (Haynes et aL Proc. Natl. Acad. Sci. U.S.A. 76:5928-5833 (1979)) and are the 
precursors of the cytotoxic T cells generated in mixed lymphocytic culture (Morishima et aL, J. Immunol . 
129:1091-1098 (1982)). Furthermore, CD7 has been found to be the most reliable marker for the identifica- 
tion of T cell acute lymphoblastic leukemia (Link et aL, Blood 62:722-728 (1983)). As such, studies have 

40 been carried out, in which cytotoxins coupled to the anti-CD7 antibody 3A1 were used to purge bone 
marrow prior to reinfusion to avoid early relapse in autologous bone marrow transplants or as prophylaxis 
against graft vs. host disease in allogenic bone marrow transplants (Ramakrishnan et aL, £ Immuol . 
135:3616-3622 (1985)). Similarly, anti-CD7 antibodies also show promise as immunosuppressive agents in 
the treatment of allograft rejections (Raftery et aL, Transpl. Proc , 17:2737-2739 (1985)) which is in accord 

45 with the recent observation that the anti-CD7 antibody 7G5 significantly inhibits the primary mixed 
lymphocyte reaction (Lazarovits et aL Leukocyte Typing IIL Oxford Univ. Press, Oxford (1987)). 

At present the physiological role of CD7 is not understood. It is known that anti-CD7 antibodies are not 
mitogenic, and do not block the T cells' response to PHA, or tetanus toxoid (Palker et aL Leukocyte Typing 
. Springer-verlag, New York, 303-313 (1985)). Some have noted that expression of CD7 in thymocytes 

so occurs prior to the onset of T cell receptor beta-chain rearrangement (Pittaluga et aL, Blood , 68:134-139 

(1986) ) and have pointed to a possible role for CD7 in this rearrangement and subsequent expression of the 
T cell receptor. It is clear that the cloning of the CD7 antigen would further efforts to understand its role in T 
cell physiology. Nucleotide sequencing and preliminary characterization of two cDNAs encoding the CD7 
antigen was carried out according to the method of the present invention. Prompted by the recent 

55 suggestion that CD7 may be, or be part of, the T cell IgM receptor (Sandrin et aL, Leukocyte Typing IN. 
Oxford Univ. Press, Oxford (1987)), the ability of COS cells expressing CD7 to bind IgM or IgM immune 
complexes was evaluated. The results do not support the simple notion that CD7 itself is an IgM receptor. 
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Preparation of cDNA library and recovery and characterization of CD7 clones 

Preparation of an HPB-ALL cDNA library in the expression vector piH3 was carried out as described 
herein. Panning of the library was carried out according to the method of the present invention, using 
5 purified anti-CD7 antibody Leu9 (Becton Dickinson) and antibody 7G5 as ascites fluid was diluted 1/1000. 
Methods for cell transfection, radioimmunoprecipitation, DNA and RNA blot hybridization and DNA sequen- 
cing were all as described herein. 



70 IgM and IgG binding by COS cells transferred with CD7 and CDw32 

Human Igm, IgG, and IgA antibodies, affinity purified FITC conjugated goat anti-human immunoglobulins 
antibodies (anti-!g(G + M + A)), washed and preserved bovine red blood cells, and IgG and IgM fractions of 
rabbit anti-bovine red blood cell antibodies were purchased from Cooper Biomedical (Maiverne, PA). COS 

15 cells were transfected by the DEAE Dextran method with cDNAs encoding the CD7, CDw32, and CD28 
surface antigens. 48 hours after transfection the cells were washed with PBS/0.5% BSA and incubated with 
either human IgM, IgG or IgA antibodies at a concentration of 1ug/ml, at 4° C for 2 hours. Subsequently the 
cells were washed with PBX/0.5% BSA and incubated for 30 minutes at 4°C with FITC conjugated rabbit 
anti-human immunoglobulins. After washing the cells were examined with a fluorescence microscope. The 

20 experiments were also performed in the presence of 0.1% azide with the same results. 

Bovine erythrocytes for rosette assays were prepared as described by Ercolani et al., J. Immunol . 
127:2044-2051 (1981). Briefly, a 2% suspension of bovine erythrocytes was washed with PBS/0.5% BSA 
and treated with subagglutinating amounts of either IgG or the IgM fraction of rabbit anti-bovine erythrocyte 
antibodies at 4*C for 1 hour. Erythrocytes were then washed twice with PBS/0.5% BSA and adjusted to a 

as 2% solution. 2 ml of antibody-coated erythrocytes were layered on 60mm dishes containing COS cells 
which has been transfected 48 hours earlier with either CD7, CD32 or CD28 by the DEAE Dextran method. 
The dishes were then centrifuged at 150 X g at 4° C for 15 minutes. After an additional 45 minute 
incubation at 4*C, the plates were gently washed 5 times with the 5 mis of PBS/0.5% BSA, and the COS 
cells were examined for rosette formation. These experiments were also performed in the presence of 0.1% 

30 sodium azide without alteration of the results. 



Formation of T ceil rosettes with antibody-coated erythrocytes 

35 Peripheral biooc lymphocytes were obtained from heparinized blood by centrifugation at 4*C over a 
Ficol-Hypaque gradient at 400 x g for 30 minutes. Leukocytes at the interface were washed two times with 
PBS. The leukocytes were adjusted to 10Y7 cells/ml in IMDM/10% Fetal Bovine Serum (FBS) and 
incubated in tissue culture dishes at 37' C for 30 minutes. Nonadherent cells were transferred to new 
dishes, and PHA was added to stimulate proliferation of T lymphocytes. On the next day the cells were 

40 washed with PBS and placed in fresh IMDM/10%FBS. 

Rosette assays were performed three days later. Cells were washed with PB/0.5% BSA, and a 10 ul 
suspension of 2% lg-coated erythrocytes prepared as described above was added to 10 ul of PBS/0.5% 
BSA containing 5 X 10 6 cells/mi. The mixtures were placed in Falcon round bottom 96 well plates and 
centrifuged at 150 X g for 15 min at 4°C. After an additional incubation of 45 min at 4*C pellets were 

45 resuspended with 10 ul of PBS/0.5% BSA, and the rosettes scored by phase contrast microscopy. The 
experiments were carried out in both the presence and absence of 0.1 % sodium azide with no detectable 
difference. 



so Isolation of cDNAs encoding the human CD7 antigen. 

To isolate CD7 cDNAs, a large plasmid library was constructed in the expression vector H3M as 
describe hereinabove. The library was introduced into COS cells by spheropiast fusion, and allowed to 
replicate and express its inserts. The COS cells were harvested by detaching without trypsin 48 to 72 hours 
ss after transfection, treated with monoclonal antibodies specific for surface antigens believed to be encoded in 
the library, and distributed in dishes coated with affinity purified anti-mouse antibody as described herein. 
Under these conditions, cells expressing surface antigen adhere and the remaining cells can be washed 
away. 
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A Hirt (Hirt, J. MoL Biol., 26:365-369 (1967)) fraction was prepared from adherent cells, and the 
resulting DNA transformed back into E. coli for further rounds of fusion and selection. In the third round of 
selection the detached cells were treated with a mixture of monoclonal antibodies specific for CD7 (765 and 
Leu9), and a Hirt supernatant was again generated and transformed into E. coK. After transformation of the 

s DNA into E. coli 8 colonies were picked, and the plasmid DNA prepared from them by an alkaline miniprep 
procedureTMa-niatis, et aL, Molelcular Cloning: A Laboratory Manual , Cold Spring Harbor Press, Cold Spring 
Harbor, New York (1982)). DNA was preparecTfrom 8 resulting colonies and transfected into COS cells. 
After 3 days, surface expression of the CD7 antigen was detected by indirect immunofluorescence in 7 of 8 
transfected dishes. Restriction enzyme digest of the corresponding plasmid DNAs revealed two species. 

w One contained a 1 .2 kb insert, and the other a 1 .3 kb insert. 



CD7 cDNA sequence analysis 

75 Both isolates were sequenced by the dideoxynucieotide method. The 1 .2 kb cDNA encodes a long 
open reading frame of 240 residues having the typical features of an integral membrane protein. The initial 
assignment of the signal sequence cleavage site by the method of von Heijne (Nuci. Acids Res. 14:4683- 
4690 (1986)) was at the 18th residue, it later was determined, however, that the homology with im- 
munoglobulin variable regions would better predict the mature terminus at residue 26; this assignment 

so would also correlate well with the position of the intron as discussed below and as shown in Figure 8. 
Removal of the predicted N-terminal signal sequence gives a mature protein of 215 residues with a 
predicted molecular mass of 23kd. In the extracellular domain are two N-linked glycosylation sites (Asn-X- 
Ser Thr), in agreement with the results of Sutherland et aL (J. Immunol. 133:327-333 (1984)), who also 
showed the presence of O-linked glycans and covalently associated palmitic acid on the mature protein. In 

25 the 27 amino acid hydrophobic membrane spanning domain is a single cysteine residue which may be the 
site of fatty acylation (Rose et aL, Proc. Natl. Acad. Sci. USA 81_:2050-2054 (1984); Kaufman et aL, J. Biol. 
Chem. 259 :7230-7238 (1984)). The length of the cytoplasmic domain, 39 residues.is in good agreement with 
the 30-40 amino acids predicted by protease digestion of the CD7 precursor in rough microsomal 
membrane fractions (Sutherland et aL, J. Immunol. 133:327-333 (1984)). 

30 Sequence analysis of the 1.7 kb clone (Figure 8) revealed the presence of an intron located 121 bp 
from the 5 end. The 411 bp intron contains stop codons in all three reading frames and is located just 
downstream of the secretory signal sequence, as is frequently observed for secreted or surface proteins. 
Both the 5' and 3' ends of the intron conform to the splice donor/acceptor consensus AAG 
GTRAGA/.../Ys-i 1 NYAG A (Mount, NucL Acids Res. 10:459-472 (1982)). Because both the 1.2 and 1.7 kb 

35 clones express CD7 antigen equally well in COS cells, the intron must be excised in COS cells fairly 
efficiently. 

Comparison of the amino acid sequence with the National Biomedical Research Foundation database 
revealed substantial homology with human and mouse immunoglobulin kappa chain and T-cell receptor 
gamma chain variable regions over almost the entire extracellular portion of the molecule. Two cysteine 

40 residues shared in approximately equal spacing by all three structures fall in the conserved sequences He- 
Thr-Cys and Tyr-X-Cys. In kappa chain variable regions these cysteins form a disulfide bridge. The 
presence of at least one intrastrand disulfide bond in the CD7 structure has previously been proposed by 
Sutherland et al. (J. Immunol. 133:327-333 (1984)), who noted that immu no precipitation of CD7 gave rise to 
a band witharTapparent molecular mass of 40 kd under reducing conditions and 38 kd under nonreducing 

45 conditions. 

Based on the homology with immunoglobulin V-regions, it is predicted that CD7 contains a disulfide 
bond linking Cys 23 and Cys 89. A second disulfide bond, linking Cys 10 and Cys 117, has been proposed, 
based on the structural similarity between CD7 and Thy-1. The extracellular domains of both Thy-1 and 
CD7 have 4 cysteine residues, in roughly homologous positions. The 4 cysteine residues, in roughly 
so homologous positions. The 4 cysteine residues of Thy-1 are joined in two internal disulfide bridges between 
Cys 9-111 and Cys 19-85 (Williams et aL, Science 216:696-703 (1982)). In Thy-1, Cys 111 forms an amide 
bond with the ethanolamine moiety of a substituted phosphatidylinositol, and is thus the last residue of the 
mature moleculs (Tse et aL, Science 230:1003-1008 (985)). !n CD7, Cys 117 is followed by four repeats of a 
sequence whose consensus is Xaa-Pro-Pro-Xaa-Ala-Ser-Ala-Leu-Pro, and which, it is proposed, plays the 
55 role of a stalk projecting the V-like domain away from the surface of the cell. 

In addition to the homologies shown in Figure 20 and mentioned above, the extracellular domain of CD7 
has significant homology with both chains of the rat CDS heterodimer (Johnson et aL, Nature 323:74-76 
(1986)), and the myelin P 0 protein (Lemke et aL, Cell 40:501-508 (1985)). 
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CD7 directs the production of a 40 kd protein in transfected COS cells 

Immunoprecipitation of CD7 antigen from transfected COS cells was carried out as described herein 
using monoclonal antibody 7G5 (Lazarovits et aL, Leukocyte Typing IH, Oxford Univ. Press, publisher, 

5 Oxford, England (1987). The material obtained from COS cells migrated with as a broad band with 
molecular weight of 40 kd under reducing conditions. Under the same conditions HPB-AII cells (the cDNA 
donor line) and activated T cells gave bands with molecular widths of 41 and 39 kd respectively. In both the 
COS cell and HPB-ALL lane a faint band with molecular weight of 30 kd was also observed, possibly 
corresponding to a partially glycosylated precursor (Sutherland, D.R., et aL, J. Immunol, 133:327-333 

io (1984)). 



RNA blot analysis 

75 Equal amounts of total RNA prepared from cell types expressing or lacking CD7 were subjected to 
Northern blot analysis as described herein. A single 1 .3 kb species was visible in lanes containing RNA 
from thymocytes, activated T cells, resting T ceils, and the T cell leukemia lines HuT-78, HPB-ALL, Jurkat 
J3R7, HSB-2 and PEER. With the exception of the PEER cell line, none of the T cell tumors showed 
significant overexpression of CD7 transcripts. CD7 RNA was detected in all of the thymus-derived cells, but 

20 not in RNA from U937 (histiocytic leukemia) and Namalwa (Burkitt Lymphoma) cells. No band correspond- 
ing to the 1 .7 kb cDNA could be detected, suggesting that this species is artificially enriched during the 
cloning or library amplification process. 

Enrichment during amplification seems unlikely because the 1 2 kb cDNA clone propagates as well in E. 
coli as the 1 .7 kb clone. However, immediately upstream and downstream from the site of insertion of the 

25 intron are sequences that could form an interrupted stem and loop structure. Eight of the 10 basepairs of 
the potential stem at GC pairs, perhaps giving the structure sufficient stability to interfere with elongation of 
the cDNA first strand. The presence of the intron greatly separates the two halves of the stem, potentially 
eliminating the structure via unfavorable loop entropy and allowing efficient first strand synthesis. 

30 

The CD7 gene is not rearranged 

Southern blot analysis of genomic DNA from placenta, peripheral blood lymphocytes, T cells, HeLa 
cells, or the tumor lines used in the RNA blot analysis above showed identical Dra 1 digest patterns. Thus, 
35 the CD7 gene is not grossly altered during development, and the high level of expression in the PEER cell 
line is not the consequence of a substantial genomic rearrangement. 



COS cells expressing CD7 do not bind IgM 

40 

Human peripheral blooc T lymphocytes express receptors for IgM antibodies (FcRu: Moretta et al., Eur. 
J. Immunol. 5:565-569 (1975); McConnell et aL, Immunol. 30:835-837 (1976)). Recently it has been reported 
that CD7 might play a role in IgM binding by T cells (Sandrin et aL, Leukocyte Typing 111, Oxford Univ. 
Press, publisher, Oxford, England (1987)). L cells, normally CD7~ and FcRu - , become CD7* and FcRu + 

45 when transfected with a 16 kb genomic fragment encoding the CD7 antigen (Sandrin et aL, Leukocyte 
Typing III, Oxford, Univ. Press, publisher, Oxford, England (1987)). Furthermore, IgM binding to CD7- 
positive cells can be blocked by the anti-CD7 monoclonal antibody Huiy-m2 (Thurlow et aJ, Transplantation 
38:143-147 (1984)), and IgM columns bind a 37 kd protein from radiolabeled lysates of peripheral blood T 
lymphocytes (Sandrin et aL, Leukocyte Typing IH, Oxford Univ. Press, publisher, Oxford, England (1987)). 

so Accordingly, COS cells expressing CD7 were tested for their ability to bind IgM. IgM receptor activity 
was assayed either by direct binding (Hardin et aL, Proc. Natl. Acad. Sci. USA 76:912-914 (1979)) or by a 
rosette assay with ox erythrocytes coated with an IgM fraction of rabbit anti-bovine red cell serum as 
described by Ercolanl et aL J. Immunol. 127 :2044-2051 (1981)). Cells expressing CD7 neither bound human 
IgM nor formed rosettes with Igm-coated erythrocytes. Under the same conditions, COS cells transfected 

55 with a cDNA encoding the human IgG receptor CDw32 bound IgG directly and formed rosettes with IgG- 
coated erythrocytes. Erythrocytes coated with IgM or IgG antibodies also adhered to a fraction of peripheral 
blood lymphocytes as reported (Moretta et aL, Eur ^ Immunol. 5:565-569 (1 975)). 

These results do not support the notion that the CD7 antigen is by itself an IgM receptor, although they 
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do not exclude the possibility that COS cells suppress IgM binding activity in some manner, or that CD7 is 
part of, or modified to become, an IgM receptor. That CD7 is not by itself an IgM receptor is supported by 
the observation that a number of CD7* T ceil lines are FcRu-(Sandrin et aL, Leukocyte Typing in, Oxford, 
Univ. Press, publisher, Oxford, England (1987)). 

5 

EXAMPLE V Isolation and Molecular Cloning of the Human CDw32 Antigen 

A cDNA encoding the human CDw32 antigen, a human receptor for immunoglobulin Q constant 

w domains (Fc receptor), was isolated by the method of the present invention, by virtue of its affinity for its 
ligand, IgG. The sequence of the isolated clone is most closely related to the murine beta 2 Fc receptor, 
but has diverged completely in the portion encoding the cytoplasmic domain. The receptor expressed in 
COS cells shows a preference for IgGj among IgG subtypes, and no affinity for IgM, IgA or IgE. 

To isolate the Fc receptor clone, cDNA libraries were prepared from tumor cell lines or from a human 

75 tumor and transfected into COS cells. After 48 hours, the cells were treated with mouse or human IgG 
antibodies, and allowed to settle on dishes coated with affinity-purified sheep anti-mouse IgG or goat anti- 
human IgG antibodies. After lysis, DNA recovery, and transformation in B coli , the cycle was repeated for 
two more rounds. Although no positive clones were isolated from the tumor line libraries, a cDNA clone 
encoding an Fc receptor was isolated from a library prepared from a human adrenal tumor. It has been 

20 discovered that many tumors are heavily infiltrated by macrophages and lymphocytes. Thus, tumor RNA 
may be a productive source in general for transcripts of human macrophages. 

By indirect immunofluorescence assay, the human receptor expressed on COS cells bound all mouse 
and human IgGs with relatively low affinity -10 _7 M), and a clear discrimination was noted among human 
antibodies for IgGi. Human IgM, IgAi, and lgA 2 , and IgE did not bind, nor did murine IgM or IgA. As 

25 expected, human Fc, but not Fab fragments, bound to the transfected cells. Among monoclonal antibodies 
donated to the Third International Workshop on Leukocyte Differentiation Antigens, three gave strong 
positive immunofluorescence: two (out of two) recognizing the Fc Receptor CDw32 determinant, and one 
(out of four) recognizing the CD23 (B cell IgE Fc receptor) determinant. Monoclonals recognizing the T 
cell/Macrophage Fc receptor antigen CD16 gave only weak immunofluorescence comparable to that shown 

30 by control ascites. 

Radioimmunoprecipitation of transfected COS cells with CDw32 antibodies showed the presence of a 
single 40kd species, comparable in size to the antigen recognized on the surface of the myeloid CDw32 
line HL-60, and to the less abundant antigen present on the histiocytic leukemia line U937. This result 
reinforces the notion that the isolated receptor is CDw32, as the CD16 receptor is reported to be 

as substantially larger (60-70kd). 

The nucleotide sequence of the isolated receptor (Figure 9) is highly homologous to that of members of 
the recently isolated murine receptor family, and most closely related to the murine beta 2 receptor by 
nucleic acid homology. Surprisingly, the murine beta 2 receptor is found on T and B lymphocytes and 
macrophages, while the alpha receptor is restricted to macrophages; in the human system, C0w32 (shown 

40 here to be beta 2 -like) is restricted to macrophages while another Fc receptor (CD16) is found on 
lymphocytes and macrophages. The human sequence appears to have diverged from the mouse sequence 
by insertion of approximately 1 kb of DNA a few bases 3 to the junction between the transmembrane and 
cytoplasmic domains. The junctions of the insertion site do not show obvious relationships to splice donor 
and acceptor sequences. Comparison of the human and murine peptide sequences showed that the peptide 

45 sequence diverges at the end of the transmembrane domain, before the nucleotide sequence diverges, 
suggesting the existence of a selective pressure favoring the creation of a different cytoplasmic domain. 

RNA blot analysis showed that myeloid but not lymphocytic ceil lines expressed RNA homologous to 
the CDw32 probe. DNA blot analysis showed multiple bands consistent with the existence of a small 
multigene family. 

50 

EXAMPLE VI Isolation and Molecular Cloning of Two cDNA Clones Encoding the B Lymphocyte-specific 
CD20 (B1, Bp35) Antigen 

55 Recent studies suggest that the pan B cell antigen CD20 (B1, Bp35) plays an important role in B cell 
activation. Monoclonal antibodies (mAb) to CD20 induce different cellular responses depending on the 
antibody used and the stage of differentiation or activation of the target B cells. The monoclonal antibody 
1F5 activates resting B cells by initiating the transition from the Go to the Gi phase of the cell cycle, and 
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induces dense tonsillar B ceils to proliferate (Clark et aL, Proc. Natl. Acad. Sci USA 82:1766 (1985); Clark 
and Shu, J. Immunol. 138:720 (1987)). However, 1F5 does not induce an increase in cytoplasmic free 
calcium and does not induce circulating B cells to proliferate (Rabinovitch et aL, In: Leukocyte Typing 111 - 
(McMichael, Ed.)» P- 435, Oxford University Press (1987))/ Other anti-Cd20 mAbs, such as B1, have been 

5 shown to block B cell activation (Tedder et aL, J. Immunoi. 135:973 (1985)) and both 1 F5 and B1 can inhibit 
B cell differentiation (Golay et aL, J. Immunol 135:3795 (1985)). Recently it has been suggested that 
phosphorylation and internalization of CD20 may be necessary steps for B cell entry into the Gi phase of 
the cell cycle (Valentine et aL, In: Leukocyte Typing 111 (McMichael, Ed.), p. 440, Oxford University Press 
(1987)). In the present example, two CD20 cDNA clones were isolated and expressed using the methods of 

io the present invention. 



Preparation of cDNA Library and Recovery of cDNA Clones by Panning 

15 Poly (A)* RNA was prepared from the human Burkitt ceil line Daudi by oligo (dT) cellulose chromatog- 
raphy of total RNA isolated by procedures described herein. cDNA preparation and expression library 
construction were carried out as described. 

Anti CD20 mAbs 1F5, 2H7, B1, L27, G28-2, 93-1 B3, B-C1, and NU-B2 were obtained from the 
International Leukocyte Typing Workshop (Valentine et aL, in: Leukocyte Typing III (McMichael, Ed.), p. 440, 

20 Oxford University Press (1987)). Purified mAbs were used at a concentration of 1ug/ml and ascites were 
used at a dilution of 1:1000. Panning was done according to the present method. In the first round of 
screening, eight 10cm dishes of 50% confluent COS cells were transfected by the DEAE-Dextran method. 
Subsequent screening cycles were performed by spheroplast fusion. 

25 

Immunoprecipitation, Sequencing, RNA and DNA Blot Hybridization 

B cell lines CESS and Daudi were metabolically labeled with 35 S-methionine and 3S S-cysteine for 6h at 
37* C. COS cells transfected by the DEAE-Dextran method were similarly labeled 36 hours post-transfec- 
30 tion. The labeled cells were incubated with the B1 mAb (Coulter) at 4* C for 1 h, washed in PBS, and lysed 
with 0.5% NP-40, 0.1% SDS, 0.5% deoxycholate and 1mM PMSF in PBS. After centrifuging (13000xg, 5 
min.), the lysate was incubated with fixed S. aureus cells (Caibiochem) for 1 hr at 4* C. The S.aureus cells 
were pelleted, washed 5 times with 1% NP-40/PBS, eluted and electrophoresed through 12.5% 
polyacrylamide gels. 

35 DNA and RNA blot analysis and hybridization probe preparation were carried out as described. 
Sequencing was done by the method of Sanger et aL Proc. Nat. Acad. Sci. USA 74:5463 (1977). The 
nucleotide sequence of the CD20.4 cDNA is represented in Figure 10. 

Two cDNA clones, bearing inserts of 1 .5 (CD20.4) and 1 .0 kb (CD20.6), were isolated from a Daudi cell 
DNA library by panning with a panel of mAbs against CD20. COS cells transfected with either clone reacted 

40 with all members of the panel of antibodies. Immunoprecipitation of the cDNA-encloded protein from 
transfected COS cells showed two bands of 32 and 30kd reminiscent of the 37 and 35kd bands observed in 
different B cell subsets and lines (Valentine et aL, "Structure and Function of the B Cell Specific 35-37kDa 
CD20 Protein," In: Leukocyte Typing IH, A. McMichael et aL, Oxford University Press, p. 440 (1987)). It has 
been the experience of the present inventors that the molecular masses of surface antigens expressed in 

45 COS cells are consistently smaller than those of their native counterparts. This may be due to differences in 
giycosylation. 

Both cDNA clones have the same coding sequence, and differ only in the 3' untranslated region. The 
insert in clone CD20.6 has a short polyA tail and lacks a consensus polyadenylation signal, while the insert 
in CD20.4 lacks a polyA tail and extends 431 bp beyond the 3' terminus in CD20.6 (Fig. 10A). 
so RNA blot analysis showed that three transcripts of 3.8, 3.0 and 1.5 kb were present in B cells but 
absent from other ceil types, in agreement with the known pattern of antibody reactivity (Clark et aL, Proc. 
Natl. Acad. ScL USA 82:1766 (1985); Clark et ai, JL Immunol. 138:720 (1987); Tedder et aL, J. Immunol. 
135:973 (1985); Golay et aL , J. Immunol. 135:3795 (1985)). It appears likely that the Cd20.6 clone is derived 
from the 1.5 kb transcript or possibly from an even shorter, undetectable species. Because the CD20.4 
55 clone lacks a poly (A) + tail, its source cannot be inferred at present. 

DNA blot analysis showed that the CD20 genomic sequences are not rearranged during development 
and are not amplified in the cell lines examined. A restriction fragment length polymorphism was observed 
in a DNA sample obtained from placenta. 
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The amino acid sequence predicted by the cDNA contains 297 residues and has a molecular mass of 
33,097 daltons. The sequence contains three major hydrophobic stretches involving residues 51-103, 117- 
141 and 183-203 (Fig.10). Two other notable characteristics are the absence of an ami no-terminal signal 
peptide and the presence of a highly charged carboxy-terminal domain. A polyclonal anti-CD20 antibody 

s that recognized the last 1 8 residues of the carboxy-terminus reacts with lysates of cells expressing CD20 
but not with intact cells, suggesting that the CD20 carboxy terminus is located within the cytoplasm. Since 
there is no amino-termina! signal peptide, it is likely that the amino-terminus is also intracellular, and that 
the first hydrophobic region acts as an internal membrane insertion signal (Zerial et aL, EM BO £ 5:1543 
(1 986)). The first hydrophobic region is composed of 53 residues and is therefore long enough to span the 

iq membrane twice if organized as an alpha helix. Because there are two remaining hydrophobic regions, the 
intracellular side. Alternatively, the carboxy-terminus requires that the first hydrophobic domain exit the 
membrane on the intracellular side. Alternatively, the carboxy-terminal antibody may only recognize 
epitopes exposed by detergent treatment allowing the carboxy-terminus to be extracellular and forcing the 
first hydrophobic domain to exit the membrane on the extracellular side. The sequence contains 2 potential 

15 N-glycosylation sites (Asn-Xaa-Ser/Thr, where Xaa cannot be Pro (Bause, Biochem. J. 209:331 (1983)) at 
positions 9 and 293, but neither of these is expected to be used if located in intracellular domains of the 
molecule. The difference in molecular mass between CD20 expressed on COS cells and on B cells is 
therefore presumably due to 0-linked glycosylation, although other forms of post-translation al modification 
are not excluded. If the carboxy-terminus is intracellular, the only extracellular domain would lie between 

20 residues 142 and 182. This region is rich in serine and threonine residues which might support 0- 
giycosylation. 

The observation of two protein species in COS cells cannot be explained by alternate splice formation 
because the cDNA sequence does not contain any promising splice donor or acceptor sequences (Shapiro 
et al., Nucl. Acids Res. 15:7155 (1987)). A difference in glycosylation or alternate translational initiation site 
25 selection may account for the two species observed. Initiation at either the first or the second ATG gives 
protein molecular masses of 33.1 and 30.8 kd respectively, in good agreement with the sizes observed in 
COS cells. Neither ATG is embedded in the consensus sequence proposed by Kozak ( Nucl. Acids Res. 
12:857 (1984)). Use of alternate initiation sites has been reported for several proteins (Kozak, Nucl. Acids 
Res. 12:857 (1984)). 

30 Comparison of the peptide sequence with the sequences in the National Biomedical Research 
Foundation database showed no significant homology by the FASTP rapid sequence alignment algorithm. 
Because the bulk of the protein appears to be confined to the interior of the membrane and the ceil, it 
seems plausible that it may play a role in transducing signals from other transmembrane proteins to the cell 
interior. Consistent with this role is the relatively hydrophilic nature of the hydrophobic regions which might 

as allow hydrogen bond interactions with the transmembrane portions of other proteins. 



EXAMPLE VII Isolation and Molecular Cloning of iCAM. An Adhesion Ligand of LFA-1 

40 Antigen-specific ceil contacts in the immune system are strengthened by antigen-non-specific inter- 
actions mediated in part by lymphocyte function associated or LFA antigens (Springer, T.A., et aL, Annu. 
Rev. Immunol. 5:223-252 (1987); Anderson, D.C., et aL, Annu. Rev. Medicine 5:175-194 (1987)). The LFA-1 
antigen, a major receptor of T ceils, B cells and granulocytes (Rothlein, R., et al., Exp. Med. 163:1132-1 149 
(1987)), is involved in cytolytic conjugate formation, antibody-dependent killing by NK cells and 

45 granulocytes, and helper T cell interactions. LFA-1 has been placed in the integrin family of cell surface 
receptors by virtue of the high sequence similarity between the LFA-1 and integrin beta chains (Kishimoto, 
T.K., et al., Cell 48:681-690 (1987); Hynes, R.O. Cell 48:549-554 (1987)). The adhesion ligands of the 
integrin family are glycoproteins bearing the Arg-Gly-Asp (RGD) sequence motif, e.g., fibronectin, 
fibrinogen, vitronectin and von Willebrand factor (Ruoslahti, E., et aL, Cell 44:517-518 (1987)). 

so In this example, the Intercellular Adhesion Molecule-1 (ICAM-1), a ligand for LFA-1 (Rothlein, R. } et al., 
J. Immunol. 137:1270-1275 (1986); Dustin, M.L, et al. J. Immunol. 137:245-254 (1986)), was cloned 
according to "the methods of the present invention. ICAM contains no RGD motifs, and instead is 
homologous to the neural cell adhesion molecule NCAM (Cunningham, B.A., et aL Science 236:799-806 
(1987); Bartheis, D. p et aL, EMBO J. 6:907-914 (1987)). COS cells transfected with the ICAM cONA clone 

55 bind myeloid cells by a specific interaction which can be blocked by monoclonal antibodies directed against 
either LFA-1 or ICAM-1. 

A cDNA library was constructed using RNA prepared from HL60 cells induced with phorbol myristyi 
acetate (PMA). The library was transfected into COS cells and cells expressing surface antigens were 
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recovered according to the methods of the present invention by panning with the anti-ICAM monoclonal 
antibodies (mAbs) 8F5 and 84H10 (McMichael, A.J., et al., eds., Leukocyte Typing IIL White Cell 
Differentiation Antigens , Oxford University Press (1987)). Episomal DNA was recovered from the panned 
ceils and the expression-panning cycle repeated a further 2 times to obtain a cDNA clone designated 
5 plCAM-1 . 

COS cells transfected with plCAM-1 gave positive surface immunofluorescence reactions with three 
anti-ICAM-1 antibodies: 8F5; 84H10; and RR-1. Immunoprecipitation of plCAM-1 -transfected COS cells with 
the mAb 84H10 gave a band of molecular mass lOOkD. 30). A slightly larger protein of 110 kD was 
precipitated from HL60 cells induced for 48 hours with either phorbol myristyl acetate (PMA), gamma- 

io interferon (gammalFN), tumor necrosis factor (TNF), or interleukin-1 beta (IL-1 beta), but was absent from 
uninduced cells. The smaller molecular mass of ICAM-1 expressed in COS cells is consistent with the lower 
molecular masses observed for other surface antigens expressed in COS cells. 

RNA blot analysis showed 2 species of 3.2 kb and 1.9 kb present in HL60 cells stimulated with either 
PMA, gamma I FN, TNF or IL-1 gamma, but absent in uninduced cells. Thus, the expression of ICAM-1 is 

15 regulated by a number of cytokines, apparently at the level of transcription. Similar species were present in 
B cells (JY and Raji), T Cells (Peer and T blasts) and Lymphokine Activated Killer Cells 'l_AK). The structure 
of these ICAM-1 transcripts and their relationship to the plCAM-1 cDNA remains to be established. Blot 
hybridization of genomic DNA from placenta revealed a pattern consistent with a single copy gene. 

To investigate whether plCAM-1 encodes a functional cell adhesion molecule, COS cells expressincj 

20 ICAM-1 were tested for their ability to bind HL60 cells. After 30 minutes at 37° C in the presence of Mg 2 *, 
HL60 cells strongly adhered to the ICAM-expressing COS cells, but not to mock transfected cells. The 
specificity of this adhesion was demonstrated by preincubating the ICAM-1 expressing COS cells with mAb 
84H10. All HL60 binding was abolished under these conditions. An isotype matched monoclonal antibody, 
W6/32, which recognizes a monomorphic H LA- ABC related determinant of approximately equal abundance 

25 to ICAM-1 on transfected COS cells, had no effect on the adhesion. Similarly, preincubation of the HL60 
cells with either 84H10 or W6/32 did not inhibit binding. 

To determine if LFA-1 was acting as the receptor for ICAM-1 in this system, HL60 cells were pretreated 
with antibodies against the beta chain of LFA-1 (CD18 (McMichael, A.J., et al., eds M Leukocyte Typing IIL 
White Ceil Differentiation Antigens , Oxford University Press (1987))) and then subjected to the binding 

30 assay. All adhesion to ICAM-expressing COS cells was blocked. Pretreatment of COS cells with the CD18 
antibodies had no effect on the adhesion. This provides direct evidence that ICAM-1 is indeed acting as an 
adhesion ligand for LFA-1 . 

The sequence of the plCAM-1 cDNA insert consists of 1846 nucleotides (Fig. 11). The predicted 
peptide sequence of 532 residues has the typical features of a transmembrane protein including a putative 

35 signal sequence, which may be cleaved between glycine-25 and asparagine-26 (von Heijne, G„ Nucl. Acids 
Res. 14:4683-4690 (1986)), and a single 25 residue membrane-spanning domain terminating in a short, 
highly charged cytoplasmic domain. The extracellular domain contains seven potential N-linked glycosyla- 
te sites which could adequately explain the difference in size between the deglycosylated precursor (55 
kD) and the final product (90-115 kD) (Dustin, M.L., et aL, J. Immunol. 137:245-254 (1986)). Differential use 

40 of these putative glycosyiation sites could also explain the heterogeneous molecular mass of ICAm-1 
observed in different cell types (Dustin, M.L., et aL, J. Immunol. 137:245-254 (1986)). 

LFA-1 is a member of the integrin family of cell surface receptors (Kishimoto, T.K., et aL, Cell 48:681- 
690 (1987); Hynes, R.O., Cell 48:549-554 (1987)). The tripeptide motif Arg-Gly-Asp (RGD) is a common 
feature of the ligands for this family, e.g., fibronectin, fibrinogen, vitronectin and von Willebrand factor, and 

45 is crucial for ligand-receptor interaction (Ruosiahti, E, et aL, Cell 44:517-518 (1987)). However, ICAM-1 
contains no RGD motifs, bearing instead a single RGE sequence at position 152. A search of the National 
Biomedical Research Foundation (Dayhoff, M.O., et aL, Methods Enzymol. 91:524-545 (1983)) (NBRF) 
database revealed no significant similarities to other proteins. However, a comparison to a laboratory 
database containing recently published surface proteins did reveal a surprising and significant similarity 

so between ICAM-1 and the neural cell adhesion molecule NCAM-1 (Cunningham, B.A., et aL, Scienc 236:799- 
806 (1987); Barthels, C, et aL EMBQ J. 6:907-914 (1987)). The optical alignment score obtained using the 
NBRF ALIGN program is 8 standard deviations above the mean score obtained from 500 random 
permutations of the sequences. The probability of the spontaneous occurence of an equal or higher score is 
approximately 10~ 9 . 

55 Using a database of known immunoglobulin related sequences, it has been shown that ICAM-1 may be 
divided into five Ig domains (28-112, 115-206, 217-310, 312-391, and 399-477) each of which shows 
significant similarity with other members of the Ig superfamily (Williams, A.F., Immunol. Today 8:298-3-3 
(1987)). For example, domain I is similar to CD3 whilst domains IV and V are similar to domains of myelin 
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associated glycoprotein (Arguint, M., et aj., Proc. Nati. Acad. Sci. USA 84:600-604 (1987)) and carcinoem- 
bryonic antigen (Beauchemin, N., et aL, Mol. CelL BioL 7:3221-3230 (1987)). All five Ig domains of NCAM 
align with the Ig segments in ICAM, and the principal contribution to the similarity comes from domains II 
and III of ICAM. Finally, the T cell-specific adhesion molecule CD2 shows roughly the same similarity to 
5 NCAM as does ICAM, but (CAM and CD2 are only weakly related. Thus, some precursor of NCAM is 
ancestral to both ICAM and CD2. 

The availability of a functional ICAM-1 cDNA will allow a better assessment of the role of ICAM-1/LFA-1 
mediated adhesion in antigen-specific leukocyte function, including T-cell mediated killing, T-helper re- 
sponses and antibody-dependent cell mediated killing. 

10 

EXAMPLE VIII Isolation and Molecular Cloning of the Human CD19, CD20, CD22, CDw32a, CDw32b, and 
CD40 Antigens 

15 The rapid immunoselection cloning method of the present invention was applied to isolate and clone the 
CD19, CD20, CD22, CDw32a, CDw32b, and CD40 antigens. The nucleotide sequence of CD19 is shown in 
Figure 12. The nucleotide sequence of CO20 is shown in Figure 13. The nucleotide sequence of CD22 is 
shown in Figure 14. The nucleotide sequence of CDw32a is shown in Figure 15. The nucleotide sequence 
of CDw32b is shown in Figure 16. The nucleotide sequence of CD40 is shown in Figure 17. 

20 

Equivalents 

Those skilled in the art will recognize or be able to ascertain, using no more than routine experimenta- 
25 tion, many equivalents to the specific embodiments of the invention described herein. Such equivalents are 
intended to be encompassed within the scope of this invention. 



Claims 

30 

1. A method for cloning a cDNA .encoding a cell surface antigen, comprising 

(a) preparing a cDNA library: 

(b) introducing said cDNA library into mammalian tissue culture cells; 

(c) culturing said cells under conditions allowing expression of said cell surface antigen by said cells; 
35 (d) exposing said cells to a first antibody directed against said cell surface antigen selected from the 

group consisting of CD1a, CD1b, CD1c, CD6, CD13, CD14, CD16, CD19, CD20. CD22, CD26, CD27, CD31, 
CDw32a, CDw32b, CD33, CD34, CD36, CD37, CD38, CD39, CD40, CD43, FcRIa, FcRIb, TLiSa, and Lev-8; 

(e) thereafter exposing said cells to a substrate coated with a second antibody directed against said 
first antibody, thereby causing said cells to adhere to said substrate; and 
40 (f) separating adherent from non-adherent cells. 

2. The method of claim 1 wherein said cDNA library is prepared from RNA extracted from a human cell 
line, preferably HPB-ALL or JY, or from RNA extracted from RNA extracted from human tumor cells, 
preferably human adrenal tumor cells. 

45 3. A cloned cDNA segment encoding a cell surface antigen selected from the group consisting of CD1a, 
CD1b, CD1c, CD6, CD13.CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD31, CDw32a, CDw32b, CD33, 
CD34, CD36, CD37, CD38, CD39, CD40, CD43, FcRIa, FcRIb, TLiSa, and Lev-8, or a functional derivative 
thereof. 

4. A substantially pure protein selected fromt the group consisting of CD1a, CD1b, CDlc, CD6, CD13, 
so CD14, CD16, CD19, CD20, CD22, CD26, CD27, CD31, CDw32a, CDw32b, CD33, CD34, CD36, CD37, 

CD38, CD39, CD40, CD43, FcRIa, FcRIb, TLiSa, and Lev-8, or a functional derivative thereof. 

5. Substantially pure CDw32a, CDw32b, CD20, CD19, CD22 or CD40 antigen, or a functional derivative 
of these antigens. 

6. Substantially pure cDNA comprising the nucleotide sequence shown in Figure 15, Figure 17, Figure 
55 13, Figure 12, Figure 14, or in Figure 17, or a functional derivative of these cDNAs. 

7. Substantially pure protein having the amino acid sequence shown in Figure 10, or a functional 
derivative thereof. 
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8. A pharmaceutical composition comprising substantially pure cell surface antigen, especially CD1a, 
CD1b, CD1c, CD6, CD13, CD14, CD16, C019, CD20, CD22, CD26, CD27, CD31, CDw32a, CDw32b, CD33, 
CD34, CD36, CD37, CD38, CD39, CD40, CD43, FcRIa, FcRIb, TLiSa, or Lev-8, or a functional derivative of 
said cell surface antigen; said cell surface antigen present in soluble form or in the form of an acceptable 

5 salt, preferably additionally comprising an acceptable carrier and/or a therapeutic agent 

9. An immunotherapeutic agent, comprising substantially pure cell surface protein, an acceptable carrier 
and a therapeutic agent. 

10. A kit useful for immunodiagnostic assay, comprising substantially pure ceil surface antigen in 
soluble form and a containing means. 

10 
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1 GGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

51 TTGTTTGCCG GATCAAGAGC TACCAACTCT TFTTCCGAAG GAACTGGCTT 

101 CAGCAGAGCG CAGATACCAA ATACTGTCCT TCTAGTGTAG (XGTAGTTAG 

151 GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA 

201 ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 

251 GTTGGACTCA AGACGATAGT TACOJGATAA GGCGCAGCGG TCGGGCTGAA 

301 CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 

351 CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG 

401 GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC 

451 GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC 

501 GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 

551 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCCGAATTa ccgcggtCTT 

601 TCTCAACGTA ACAOTTACA GCGGCGCGTC AffTGATATG ATGCGCCCCG 

651 CTTCCCGATA AGGGAGCAGG CCAGTAAAAG CATTACCCGT GGTGGGGTTC 

701 CCGAGCGGCC AAAGGGAGCA GACTCTAAAT CTGCCGTCAT CGACTTCGAA 

751 GGTTCGAATC CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAAtC 

801 TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGTAAAATT 

851 TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT 

901 TAGGGTTAGG CGTTTTGCGC TGCTTCGCGA TGTACGGGCC AGATATACGC 

951 GTTGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 

1001 TTAGTTCATA GCCCATATAT GGAGTTCGGC GTTACATAAC TTACGGTAAA 

1051 TGGCCCQCCT GQCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 

1101 TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 

1151 TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA 

1201 TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG 

1251 CCTGQCATTA TQCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG 

1301 TACATCTACG TAnAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA 

1351 GTACATCMT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCMGTC 

1401 TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 

1451 GACTTTCCAA MTGTCGTAA CAAaCCGCC CCATTGACGC AAATGGGCGG 

1501 AATTCCTGGG CGGGACTGGG GAGTGGCGAG CCCTCAGATG CTGCATATAA 

1551 GCAGCTGCTT TTTGCCTGTA CTGGGTCTCT CTQGTTAGAC CAGATCTGAG 

1601 CCTGGGAGCT CTCTGGCTAA CTAGAGAACC CACTGCTTAA GCCTCMTM 

1651 AGCTTCTAGA GATCCCTCGA CCTCGAGGGA TCTTCCATAC CTACCAGTTC 
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CCTMGATGAQCTTTCCA^^^ (60) 

METSERPHEPROCYSLYSpHEVALALASERPhELEULEUlLEPHEASNVALSER 

TCCAMGGT&AGTCTCCAMGAGATT^^ (120) 
SerLys&yAuValSbrLysGluI^^ 

CAGGACATCMCWGGACAiTCCTAGTTTtCAMTGAGTGATGATAnGACGATAT (180) 
20 GlnAspIleAsnLeuAspIleProSerFIheGlnMetSerAspAspIleAspAspIleLys 

TGGGAAAAMCnCAGACMGAAAMGATTGCACM^ (240) 
40 TrpGluLysThrSerAsfIysLysLys^ 

MGGAAAMGATACATATAAGCTATTTAAAM^ (300) 
60 Lys(LuLysAspThrTyrLysLeiPheLysAsnGlyThrI^ysIleLysHisLelLys 

. — CHO — . 

ACCGATOTCAGGATATC^ (360) 
80 ThrAspAsKLnAspIleTyrLy^^ 

GAAAAMTATnGATnGMGAnCMGAGAGGGTCTCAAMCCAM(^TCTCCTGGACT (420) 
100 GluLysIlePheAspLeuLysIleGlnGluArgValSerLysProLysIleSerTrpThr 

TGTATCMCACMCCCTGACCTGTGAGGTAATGAATGGAACTGACCCCGAaTTAAACCTG (480) 
120 CysIleAsnThrThrI^ThrCysGluValI^AsnGlyThrAspProGllIbjAsnLeu 
— CHO — — CHO — 

TATCAAGATGGGAAACATCTAAMCTTTCTCAGAGGGTCATCACACACAAGTGGACCACC (540) 
140 Ty^lnAspGlyLysHisLeuLysLeuSer(LnArgValIleThrHisLysTrpThrThr 

A(m(^<XAAMFCMGm (600) 

160 SERl^USERALALYSFteLYSCYSTHRALAGLYAsrlYSVALSERLYSGLUSERSERVAL 

GAGCCTGTCAa:TGTCCAGAGAMGGTCT^ACATCTATCTCATCATTGGCATA^ (660) 

180 GLUPROVALSERCYSFftoGL^ 

GGAGGCAGCCTCnGATGGtCTTTGTG(X^ACTGCTCGTTTTCTATATCACCAAAAGGAAA (720) 
200 GlyGlySerI^euMeWal^ValAlaLeuLeuValPheTyrI 
-j^ 

AMCAGAGGAGTCGGAGAMTGATGAGGA(XTGGAGACMGA(XCCACAGAGTAGCTACT (780) 
220 LysGij4ArgSerArgAr<^nAspGluGluLeuGluThr^ 

GMGAMGG^CGGMGCCCCMCAMTtCCAGCnCAAcCCCTCAGAATCCAGCAACt (840) 
240 GluGluArgGlyArgLysProGlnGlnIleProAlaSerThrProGlnAsnProAlaThr 

TCCCMCATCCTCCTCCACCACCTOiTCAtCGnCCCAG(XACCTAGTCATCGTCCCCCG (900) 
260 SerGlnHis^oProF^oF^oF^oGlyHisAr(^rGlnAlaRroSerHisArgProPro 

CCTCCTGGACACCGTGTTCAGCACCAGCCtCA(W\GAGGCCTCCTGCTCCGTCGGGCACA (960) 
280 ProProGlyHisArgValGlnHisGlnProGlnLysArgProProAlaProSerGlyThr 
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CAAGTTCACWQCAfiAMGKCCGCCCCTCCCCAGACCTCGAGTTCAGCCAAAACCTCCC 
300 GlnValHisGlnGl^ysGlyProProLeiProArgProArgValGln^oLysProPro 

CATGGGGCAGCAGAMACTCAnGTCCCCiTC 
320 HisGlyAlaAlaGluAsnSerLeuSerProSerSerAsinEnd 

TTnCMTAAAMGCACTGtGGATTTCTGCCCTCCTGATGTGCATATCCGTACTTC 

AGGTGTTTTCTGTGTGCAGAACAnGTCACCTCCTGAOXTGTGGGCCACAGCCACCT^ 

GCATCnCGAACTttQCCATGTGGTCIMCATCTGGAGTllTrGGTCTCCtCAGAGAGCTC 

{^TCACACCAGTMGGAGAAGCMTATAAGTGTGAnGCMGMTGGTAGAGGACCGAGC 

ACAGAMTCtTAGAGATncnGTCCCCTCTCAGGTCATGTGTAGATGCGATAMTW 

TGAnGGTGtGCCTGGGTCtCACTACMGCAGCCTATCTGCnAAGAGACTCTGGAGTTt 

CTTATGT(X:CCTGGTGGACACn(3CCCACCATCCTGTGAGTAiAA^ 

TGAC (1504) 
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1 QGCGTMTCT QCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

51 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT 

101 TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA 

151 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 

201 AATCCTGTTA CCAGTGQCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 

251 GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 

301 ACGGGQGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 

351 ACTGAGATAC CTACAGCGTG AGCATTGAGA AAGCGCCACG CTTCCCGAAG 

401 GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 

451 CGCACGAGGG AGC7TCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 

501 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 

551 GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCAAGCTA GCTTCTAGCT 

601 AGAAATTGTA AACGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 

651 AAAiCAGCTC ATTTnTAAC CAATAGGCCG AAATCGGCAA AATCCCTTAT 

701 AAATCAAAAG AATAGCCCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 

751 CAAGAGTCCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 

801 CCGTCTATCA GGGCGATGGC CGCCCACTAC GTGAACCATC ACCCAMTCA 

851 AGTTTTTTGG GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG 

901 GAGCCCCCGA TTTAGAGCTT GACGGGGAM GCCGGCGAAC GTGGCGAGAA 

951 AGGAAGGGAA GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA 

1001 GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGCTTA ATGCGCCGCT 

1051 ACAGGGCGCG TACTATGGTT GCTiTGACGA GCACGTATAA CGTGC7TTCC 
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1101 TCGTTGGAAT CAGAGCGQGA GCTAAACAGG AGQCCGATTA AAGGGATTTT 

1151 AGACAGGAAC GGTACGCCAG CTGGATCACC GCGGTCTTTC TCAACGTAAC 

1201 ACTTTACAGC GGCGCGTCAT TTGATATGAT GCGCCCCGCT TCCCGATAAG 

1251 GGAGCAGGCC AGTAAAAGCA TTACCCGTGG TGGGGTTCCC GAGCGGCCAA 

1301 AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG TTCGAATCCT 

1351 TCCCCCACCA CCATCACTTT CAAAAGTCCG AAAGAAlCTG CTCCCTGCTT 

1401 GTGTGTTGGA GGTCGCTGAG TAGTGCGCGA GTAAAATTTA AGCTACAACA 

1451 AGGCAAGGCT TGACCGACAA TTGCATGAAG AATCTGCTTA GGGTTAQGCG 

1501 TTTTQCQCTG C7TCGCGATG TACQGGCCAG ATATACGCGT TGACATTGAT 

1551 TATTGACTAG TTATTAATAG TAATCAATTA CGGGGTCATT AGTTCATAGC 

1601 CCATATATQG AGTTCCGCGT TACATAACTT ACGGTAAATG GCCCGCCTGG 

1651 CTGACCGCCC AACGACCCCC GCCCATTGAC GTCAATAATG ACGTATGTTC 

1701 CCATAGTAAC GCCAATAGQG ACTTTCCATT GACGTCAATG GGTGGACTAT 

1751 TTACGGTAAA CTGCCCACTT GGCAGTACAT CAAGTGTATC ATATGCCAAG 

1801 TACGCCCCCT ATTGACGTCA ATGACGGTAA ATGGCCCGCC TGGCATTATG 

1851 CCCAGTACAT GACGTATGG GACTTTCCTA CTTGGCAGTA CATCTACGTA 

1901 TTAGTCATCG CTATTACCAT GGTGATGCGG TTTTGGCAGT ACATCAATGG 

1951 GCGTQGATAG CGGTTTGACT CACGGGGATT TCCAAGTCTC CACCCCAnG 

2001 ACGTCAATQG GAGTTTGTTT TGGCACCAAA ATCAACGGGA CTTTCCAAAA 

2051 TGTCGTMCA ACTCCGCCCC ATTGACGCAA ATGGGCGGAA TTCCTGQGCG 

2101 GGACTGQGGA GTGGCGAGCC CTCAGATGCT GCATATAAGC AGCTGCTTTT 

2151 TGCCTGTACT GGGTCTCTCT GGTTAGACCA GATCTGAGCC TGGGAGCTCT 

2201 CTGGCTAACT AGAGAACCCA CTGCTTMGC CTCAATAAAG CTTCTAGAGA 

2251 TCCCTCGACC TCGAGATCCA TTGTGCTGGC GCGGATTCTT TATCACTGAT 

FIG. 6-2 
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CCCAATCATG 


AGGAAGATGC 


1601 


AGACTCTTAT 


GAGAACATGG 


ATAATCCCGA 


TGGGCCAGAC 


CCAGCCTGGG 


1651 


GAGGAGGGGG 


CCGCATGGGC 


ACCTGGAGCA 


CCAGGTGATC 


CTCAGGTGGC 


1701 


CAGCCTGGAT 


CTCCTCAAGT 


CCCCAAGATT 


CACACCTGAC 


TCTGAAATCT 


1751 


GAAGACCTCG 


AGCAGATGAT 


GCCAACCTCT 


GGAGCAATGT 


TGCTTAGGAT 


1801 


GTGTGCATGT 


GTGTAAGTGT 


GTGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTGT 


1851 


ATACATGCCA 


GTGACACTTC 


CAGTCCCCTT 


TGTATTCCTT 


AAATAMCTC 


1901 


AATGAGCTCT 


TCCAAAAAAA 


AAAA 
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1 


ACAAAGACM 


ACTGCACCCA 


CTGAACTCCG 


CAGCTAGCAT 


CCAAATCAGC 


51 


CCTTGAGATT 


TGAGGCCTTG 


GAGACTCAGG 


AGTTTTGAGA 


GCAAAATGAC 


101 


AACACCCAGA 


AATTCAGTAA 


ATGGGACTTT 


CCCGGCAGAG 


CCAATGAAAG 


151 


GCCCTATTGC 


TATGCAATCT 


GGTCCAAAAC 


CACTCTTCAG 


GAGGATGTCT 


201 


TCACTGGTGG 


GCCCCACGCA 


AAGCTTCTTC 


ATGAGGGAAT 


CTAAGACTTT 


251 


GGGGGCTGTC 


CAGATTATGA 


ATGGGCTCTT 


CCACATTGCC 


CTGGGGGGTC 


301 


TTCTGATGAT 


CCCAGCAGGG 


ATCTATGCAC 


CCATCTGTGT 


GACTGTGTGG 


351 


TACCCTCTCT 


GGGGAGGCAT 


TATGTATATT 


ATTTCCGGAT 


CACTCCTGGC 


401 


AGCAACGGAG 


AAAAACTCCA 


GGAAGTGTTT 


GGTCAAAUGA 


AAAATGATAA 


451 


TGAATTCAF 


GAGCCTCTTT 


GCTGCCATTT 


CTGGAATGAT 


TCTTTCAATC 


501 


ATGGACATAC 


TTAATATTAA 


AATTTCCCAT 


IIIIIAAAAA 


TGGAGAGTCT 


551 


GAATTTTATT 


AGAGCTCACA 


CACCATATAT 


TAACATATAC 


AACTGTGAAC 


601 


CAGCTAATCC 


CTCTGAGAAA 


AACTCCCCAT 


CTACCCAATA 


CTGTTACAGC 


651 


ATACAATCTC 


TGTrCTTGGG 


CATTTTGTCA 


GTGATGCTGA 


TCTTTGCCTT 


701 


C7TCCAGGAA 


CTTGTAATAG 


CTGGCATCGT 


TGAGAATGAA 


TGGAAAAGAA 


751 


CGTGCTCCAG 


ACCCAAATCT 


AACATAGTTC 


TCCTGTCAGC 


ACAAGAAAAA 


801 


AAAGAACAGA 


CTATTGAAAT 


AAAAGAAGAA 


GTGGTTGGGC 


TAACTGAAAC 


851 


ATCTTCCCAA 


CCAAAGAATG 


AAGAAGACAT 


TGAAATTATT 


CCAATCCAAG 


901 


MGAGGAAGA 


AGAAGAAACA 


GAGACGAACT 


TTCCAGAACC 


TCCCCAAGAT 


951 


CAGGAATCCT 


CACCAATAGA 


AAATGACAGC 


TCTCCTTAAG 


TGATTTCnC 


1001 


TGTTTTCTGT 


TTCCIIIIII 


AAACATTAGT 


GTTCATAGCT 


TCCAAGAGAC 


1051 


ATGCTGACTT 


TCATTTCTTG 


AGGTACTCTG 


CACATACGCA 


CCACATCTCT 
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1101 ATCTGGCCTT TGCATGGAGT GACCATAGCT CCTTCTCTCT TACATTGAAT 

1151 GTAGAGAATG TAGCCATTGT AGCAGCTTGT GTTGTCACGC TTCTTCTTTT 

1201 GAGCAACTTT CTTACACTGA AGAAAGGCAG AATGAGTGCT TCAGAATGTG 

1251 ATTTCCTACT AACCTGTTCC TTGGATAGGC TTTTTAGTAT AGTATTTTTT 

1301 TTTGTCATTT TCTCCATCAG CAACCAGGGA GACTGCACCT GATGGAAAAG 

1351 ATATATGACT GOTCATGAC ATTCCTAAAC TATCTTFTTT TTATTCCACA 

1401 TCTACGTTTT TGGTGGAGTC CCTTTTTATC ATCCTTAAAA CAATGATGCA 

1451 AAAGGGCTTT AGAGCACAAT GGATCT 



FIG. 13-2 
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1 ACGCGGAAAC AGGCTTGCAC CCAGACACGA CACCATGCAT CTCCTCGGCC 

51 CCTGGCTCCT QGTCCTQGTT CTAGAATACT TGGCTTTCTC TGACTCAAGT 

101 AAATGQGTTT TTGAGCACCC TGAAACCCTC TACGCCTGGG AGGGGGCCTG 

151 CGTCTQGATC CCCTGCACCT ACAGAGCCCT AGATGGTGAC CTGGAAAGCT 

201 TCATCCTGTT CCACAATCCT GAGTATAACA AGAACACCTC GAAGTTTGAT 

251 GGGACAAGAC TCTATGAAAG CACAAAGGAT GGGAAGGTTC CTTCTGAGCA 

301 GAAAAGGGTG CAATTCCTGG GAGACAAGAA TAAGAACTGC ACACTGAGTA 

351 TCCACCCGGT GCACCTCAAT GACAGTGGTC AGCTGGGGCT GAGGATGGAG 

401 TCCAAGACTG AGAAATGGAT GGAACGAATA CACCTCAATG TCTCTGAAAG 

451 GCCTTTTCCA CCTCATATCC AGCTCCCTCC AGAAA7TCAA GAGTCCCAGG 

501 AAGTCACTCT GACCTGCTTG CTGAATTTCT CCTGCTATGG GTATCCGATC 

551 CAATTGCAGT GGCTCCTAGA GGGGGTTCCA ATGAGGCAGG CTGCTGTCAC 

601 CTCGACCTCC TTGACCATCA AGTCTGTCTT CACCCGGAGC GAGCTCAAGT 

651 TCTCCCCACA GTGGAGTCAC CATGGGAAGA TTGTGACCTG CCAGCTTCAG 

701 GATGCAGATG GGAAGTTCCT CTCCAATGAC ACGGTGCAGc TgAACGTGAA 

751 GCATCCTCCC AAGAAGGTGA CCACAGTGAT TCAAAACCCC ATGCCGATTC 

801 GAGAAGGAGA CACAGTGACC CTTTCCTGTA ACTACAATTC CAGTAACCCC 

851 AGTGTTACCC GGTATGAATG GAAACCCCAT GGCGCCTGGG AGGAGCCATC 

901 GCTTGGGGTG CTGAAGATCC AAAACGTTGG CTGGGACAAC ACAACCATCG 

951 CCTGCGCAGC TTGTAATAGT TGGTGCTCGT GGGCCTCCCC TGTCGCCCTG 

1001 AATGTCCAGT ATGCCCCCCG AGACGTGAGG GTCCGGAAAA TCAAGCCCCT 

1051 TTCCGAGATT CACTCTGGAA ACTCGGTCAG CCTCCAATGT GACTTCTCAA 

1101 GCAGCCACCC CAAAGAAGTC CAGTTCTTCT GGGAGAAAAA TGGCAGGCTT 

1151 CTGGGGAAAG AAAGCCAGCT GAATTTTGAC TCCATCTCCC CAGAAGATGC 

1201 TGGGAGTTAC AGCTGCTGGG TGAACAACTC CATAGGACAG ACAGCGTCCA 

1251 AGGCCTGGAC ACTTGAAGTG CTGTATGCAC CCAGGAGGCT GCGTGTGTCC 

1301 ATGAGCCCGG GGGACCAAGT GATGGAGGGG AAGAGTGCAA CCCTGACCTG 

1351 TGAGAGCGAC GCCAACCaC CCGTCTCCCA CTACACCTGG TTTGACTGGA 

1401 ATAACCAAAG CCTCCCCTAC CACAGCCAGA AGCTGAGATT GGAGCCGGTG 

1451 AAGGTCCAGC ACTCGGGTGC CTACTGGTGC CAGGGGACCA ACAGTGTGGG 

1501 CAAGGGCCGT TCGCCTCTCA GCACCaCAC CGTCTACTAT AGCCCGGAGA 

1551 CCATCGGCAG GCGAGTGGCT GTGGGACTCG GGTCCTGCCT CGCCATCCTC 

1601 ATCCTGGCAA TCTGTGGGCT CAAGCTCCAG CGACGTTGGA AGAGGACACA 

1651 GAGgCCAGCA GGGGCTTCAG GAGAATTCCA GCGGCCAGAG CTTCTTTGTG 

FIG. 14-1 
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1701 AGGAATAAAA AGGTTAGAAG GGCCCCCCTC TCTGAAGGCC CCCACTCCCT 

1751 GGGATGCTAC AATCCAATGA TGGAAGATcG CATTAGCTAC ACCACCCTGC 

1801 gCTTTCcCGA GATGAACATA GCACGAACTG GAGATGCAGA GTCCTCAGAG 

1851 ATGCAGAGAC CTCCCCCGGA CTGCGATGAC ACGGTCACTT ATTCAGCATT 

1901 GCACAAGCGC CAAGTGGGCA CTATGAGAAC GTCATTCCAG ATTTTCCAGA 

1951 AGATGAGGGG ATTCATTACT CAGAGCTGAT CCAGTTTGGG GTCGGGGAGC 

2001 GGCCTCAGGC ACAAGAAAAT GTGGACTATG TGATCCTCAA ACATTGACAT 

2051 GGATGGGCTG CAGcAGAGGC ACTGGGGGCA GCGGGGQCCA GGGAAGTcCC 

2101 CGAGTTT 



FIG. 14-2 
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1 CCCAAATGTC TCAGAATGTA TGTCCCAGM ACCTGTGGCT QCTTCAACCA 

51 TTGACAGTTT TGCTGCTGCT GGCTTCTGCA GACAGTCAAG CTGCAGCTCC 

101 CCCAAAGGCT GTQCTGAMC TTGAQCCCCC GTGGATCAAC GTGCTCCAGG 

151 AGGACTCTGT GACTCTGACA TGCCAGGGGG CTCQCAGCCC TGAGAGCGAC 

201 TCCATTCAGT GGTTCCACAA TGGGAATCTC ATTCCCACCC ACACGCAGCC 

251 CAGCTACAGG TTCAAGGCCA ACAACAATGA CAGCGGGGAG TACACGTGCC 

301 AGACTGGCCA GACCAGCCTC AGCGACCCTG TQCATCTGAC TGTGCTTTCC 

351 GAATGGCTGG TGCTCCAGAC CCCTCACCTG GAGTTCCAGG AGGGAGAAAC 

401 CATCATGCTG AGGTGCCACA GCTGGAAGGA CAAGCCTCTG GTCAAGGTCA 

451 CATTCTTCCA GAATGGAAAA TCCCAGAAAT TCTCCCGTTT GGATCCCACC 

501 TTCTCCATCC CACAAGCAAA GCACAGTCAC AGTGGTGATT ACCACTGCAC 

551 AGGAAACATA GGCTACACGC TGTTCTCATC CAAGCCTGTG ACCATCACTG 

601 TCCAAGTGCC CAGCATGGGC AGCTCTTCAC CAATGGGGAT CATTGTGGCT 

651 GTGGTCATTG CGACTGCTGT AGCAGCCATT GTTGCTGCTG TAGTGGCCTT 

701 GATCTACTGC AGGAAAAAGC GGATTTCAGC CAATTCCACT GATCCTGTGA 

751 AGGCTGCCCA ATTTGAGCCA CCTGGACGTC AAATGATTGC CATCAGAAAG 

801 AGACAACTTG AAGAAACCAA CAATGACTAT GAAACAGCTG ACGGCGGCTA 

851 CATGACTCTG AACCCCAGGG CACCTACTGA CGATGATAAA AACATCTACC 

901 TGACTCTTCC TCCCAACGAC CATGTCAACA GTAATAACTA AAGAGTAACG 

951 TTATGCCATG TGGTCATACT CTCAGCTTGC TGAGTGGATG ACAAAAAGAG 

1001 GGGAATTGTT AAAGGAAAAT TTAAATQGAG ACTGGAAAAA TCCTGAGCAA 

1051 ACAAMCCAC CTGGCCCTTA GAAATAQCTT TAACTTTGCT TAAACTACAA 

1101 ACACAAGCAA AACTTCACGG GGTCATACTA CATACAAGCA TAAGCAAAAC 

1151 TTAACTTGGA TCATTTCTGG TAAATGCTTA TGTTAGAAAT AAGACAACCC 

1201 CAGCCAATCA CAAGCAGCCT ACTAACATAT AATTAGGTGA CTAGGGACTT 

1251 TCTAAGAAGA TACCTACCCC CAAAAAACAA TTATGTAATT GAAAACCAAC 

1301 CGATTGCCTT TATTTTGCTT CCACATTTTC CCAATAAATA CTTGCCTGTG 

1351 ACATTTTGCC ACTGGAACAC TAAACTTCAT GAATTGCGCC TCAGATTTTT 

1401 CCTTTAACAT Clllllllll TTTGACAGAG TCTCAATCTG TTACCCAGGC 

1451 TGGAGTGCAG TGGTGCTATC TTGGCTCACT GCAAACCCGC CTCCCAGGTT 

1501 TAAGCGATTC TCATGCCTCA GCCTCCCAGT AGCTGGGATT AGAQGCATGT 

1551 GCCATCATAC CCAGCTAATT TTTGTATTTT TTAI 1 1 1 1 1 1 TTTTTAGTAG 

1601 AGACAGGGTT TCGCAATGTT GGCCAGGCCG ATCTCGAACT TCTGGCCTCT 

1651 AGCGATCTGC CCGCCTCGGC CTCCCAAAGT GCTGGGATGA CCAGCATCAG 

FIG. 15-1 
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1701 CCCCAATGTC CAQCCTCTTT AACATCTTCT TTCCTATGCC CTCTCTGTGG 

1751 ATCCCTACTG CTGGTTTCTG CCTTCTCCAT GCTGAGAACA AAATCACQA 

1801 TTCACTGCTT ATGCAGTCGG AAGCTCCAGA AGAACAAAGA GCCCAATTAC 

1851 CAGAACCACA TTAAGTCTCC ATTGTTTTGC CTTGGGATTT GAGAAGAGAA 

1901 TTAGAGAGGT GAGGATCTGG TATTTCCTGG ACTAAATTCC CCTTGGQGAA 

1951 GACGAAGGGA TGCTGCAGTT CCAAAAGAGA AGGACTCTTC CAGAGTCATC 

2001 TACCTGAGTC CCAAAGCTCC CTGTCCTGAA AGCCACAGAC AATATGGTCC 

2051 CAAATGACTG ACTGCACCTT CTGTGCCTCA GCCGTTCTTG ACATCAAGAA 

2101 TCTTCTGTTC CACATCCACA CAGCCAATAC AATTAGTCAA ACCACTGTTA 

2151 TTAACAGATG TAGCAACATG AGAAACGCTT ATGTTACAGG TTACATGAGA 

2201 GCAATCATGT AAGTCTATAT GACTTCAGAA ATGTTAAAAT AGACTAACCT 

2251 CTAACAACAA ATTAAAAGTG ATTGTTTCAA GGTGAAAAAA 



FIG. 15-2 
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1 QCTGTGACTG CTGTGCTCTG GGCGCCACTC GCTCCAGGGA GTGATGGGAA 

51 TCCTGTCATT CTTACCTGTC CTTGCCACTG AGAGTGACTG GGCTGAcTgC 

101 AAGTCCCCCC AGCCTTGGGG TCATATGCTT CTGTGGACAG CTGTGCTATC 

151 CCTGGCTCCT GTTGCTQGGA CACCTGCAGC TCCCCCAAAG GCTGTGCTGA 

201 AACTCGAGCC CCAGTGGATC AACGTGCTCC AGGAGGACTC TGTGACTCTG 

251 ACATGCCGGG GGACTCACAG CCCTGAGAGC GACTCCATTC AGTGGTTCCA 

301 CAATGGGAAT CTCATTCCCA CCCACACGCA GCCCAGCTAC AGGTTCAAGG 

351 CCAACAACAA TGACAGCGGG GAGTACACGT GCCAGACTGG CCAGACCAGC 

401 CTCAGCGACC CTGTGCATCT GACTGTGCTT TCTGGTCAGT GGAGGAAGGC 

451 CCCAGGGTGG ACCTGGGAGG GCCAGGACGG ATGAAATCTG CTTTCAGGCA 

501 GAGGTTTGCA GGAAAGGGQG GTGGCCTGCT TACTGGGAAG TATCGCTGTG 

551 AGTTGCCTCA GCACATATCA GTGGTTGTTT TTGCCTCAGT TCTGATTGAA 

601 CAGAAGAAGG TTTCAAGGCC AAAAACAGQC AGCCAAGTGT GAGAGAAGCA 

651 GAAGGAAATC CCTACTGCAT AAAACCCATT TCCATTTTAA TGGCAGAATT 

701 GAAMGCACA GACCACAACT GAATCCTAGC CCTGGAMTG ACTCACTATA 

751 CAACATGATG AATTCATTTA ACCCTTGAGT TTCCATTTCT TCACCTGCTC 

801 CGTGGGGCAC TAACGCCTCC CTCAGAGGCT TCTGGTGAGA ATCAGTGTTT 

851 CCCTGCCCCC GCCCCGCCCT CCATGCCCCT TCTCCACGTT CTCACTGTGC 

901 TAGGTGCTCT TCTCTGTCTT TCTCTTCCAC CAGCCTGTGG GAAACCTGAG 

951 ATGAAAGTCG TGTCTTACCC ATCTTTGTAT TTCCAGCATC TGAAACTGGG 

1001 CAGAGCTTAA TAAATATTTT GCTGGAGAGG TTGATGATCT TACAAAGCTC 

1051 CCATTGAAAG GTGGCTCTCT GTAAAGCAAA GTTACAATGA GATTGTGATG 

1101 AACATTGTCC TTGTGGCTTT TCACTTAGTC CCCTCCCnC ACCTGAAGAG 

1151 CAAATTTTCC TCAAAAGTAC ACAGCAAACG AATGACCCAC TGGTGACACT 

1201 GTTGCCTTTA GACCCTGCTG GAAAGAAGCT CCACATTTAT TAACATTCCC 

1251 GAAGTAAATT TATCAGGTAG CATTCATCAG GTAACATTTG TTGCACATTC 

1301 ATGACTTTTC TACTGTCCAC AAAGGCATAT GTCCTTATCA TATGCGGACT 

1351 CCTCGGTCAC ACTGGATTCT TCCTTCCCTC CTCGACATGG AAGAGATGGC 

1401 ATCTTAGGGT CTCTTGTGTT CTTCCTGCAG AGGCCTGTCG GGCAGGAAAA 

1451 GGCTGCAGCT QCCTTCCTGG GAGAAGGAGG AGATGAGTGT ATCCTGAACA 

1501 CCTATTATGT QCTAGGGGCT ATTGTAGATA CATGACACTA TCATGCTCAT 

1551 TTTCACGAAT GAGGAAACTG AGGCTCAGM GACTTAAATT ATTTGCCCAA 

1601 GAGTTATAAA TGACAGAGCC AGCATTAGAG TCCAGGACTG TCTGATTTCA 

1651 GACCTAAGCT GTTCCCTCTG CACATCGTGT CCCACCAGTA AGGAAGATCT 



FIG. 16-1 
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1701 GGGTCTCAGA GCTGAGCCAA GACCTCCCGG GTCCTCTGCG GTTTTTTGTG 

1751 TCTTTCAGAG TGGCTGGTGC TCCAGACCCC TCACCTGGAG TTCCAGGAGG 

1801 GAGAAACCAT CGTGCTGAGG TGCCACAGCT GGAAGGACAA GCCTCTGGTC 

1851 AAGGTCACAT TCTTCCAGAA TGGAAAATCC AAGAAATTTT CCCGTTCGGA 

1901 TCCCAACTTC TCCATCOCAC AAGCAAACCA CAGTCACAGT GGTGATTACC 

1951 ACTGCACAGG AAACATAGGC TACACGCTGT ACTCATCCAA GCCTGTGACC 

2001 ATCACTGTCC AAGCTCCCAG aOTCACCG ATGGGGATCA TTGTGGCTGT 

2051 GGTCACTQGG ATTGCTGTAG CQGCCATTGT TQCTGCTGTA GTGGCCTTGA 

2101 TCTACTGCAG GAAAAAGCGG ATTTCAGGTT TGTAGCTCCT CCCGGTCCCT 

2151 TTTGTTATCA GTTTCCACTT T 



FIG. 16-2 
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1 GCCTCGCTCG GGCGCCCAGT QGTCCTGCCG CCTQGTCrCA CCTCGCCATG 

51 GTTCGTCTGC CTCTGCAGTG GGTCCTCTGG GGCTGCTTGC TGACCGCTGT 

101 CCATCCAGAA CCACCCACTG CATGCAGAGA AAAACAGTAC CTAATAAACA 

151 GTCAGTQCTG TTGTTGTGC CAGCCAGGAC AGAAACTGGT GAGTGACTGC 

201 ACAGAGTTCA CTGAAACGGA ATGCCTTCCT TGCGGTGAAA GCGAATTCCT 

251 AGACACCTGG AACAGAGAGA CACACTGCCA CCAGCACAAA TACTGCGACC 

301 CCAACCTAGG GCTTCQGGTC CAGCAGAAGG GCACCTCAGA AACAGACACC 

351 ATCTGCACCT GTGAAGAAGG CTGGCACTGT ACGAGTGAGG CCTGTGAGAG 

401 CTGTGTCCTG CACCGCTCAT GCTCGCCCGG CTTTGGGGTC AAGCAGATTG 

451 CTACAGGGGT TTCTGATACC ATCTGCGAGC CCTGCCCAGT CGGCTTCTTC 

501 TCCAATGTGT CATCTQCTTT CGAAAAATGT CACCCTTGGA CAAGCTGTGA 

551 GACCAAAGAC CTGGTTGTGC AACAGGCAGGC ACAAACAAGA CTGATGTTGT 

601 CTGTGGTCCC CAGGATCGGC TGAGAGCCCT GGTGGTGATC CCCATCATCT 

651 TCGGGATCCT GTTTGCCATC CTCTTGGTGC TGGTCTTTAT CAAAAAGGTG 

701 GCCAAGAAGC CAACCAATAA GGCCCCCCAC CCCAAGCAGG AACCCCAGGA 

751 GATCAATTTT CCCGACGATC TTCCTGGCTC CAACACTGCT GCTCCAGTGC 

801 AGGAGACTTT ACATGGATGC CAACCGGTCA CCCAGGAGGA TGGCAAAGAG 

851 AGTCGCATCT CAGTGCAGGA GAGACAGTGA GGCTGCACCC ACCCAGGAGT 

901 GTGGCCACGT GGGCAAACAG GCAGTTGGCC AGAGAGCCTG GTGCTGCTGC 

951 TGCAGGGGTG CAGGCAGMG CGGGGAGCTA TGCCCAGTCA GTGCCAGCCC 

ac 



FIG. 17 



